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1.  INTRODUCTION 

A  topic  of  increasing  importance  in  public-sector  management  is  the  design  and  implementation 
of  financial  incentive  systems  that  will  encourage  lower-level  government  units  and  profit-making 
organizations  under  contract  to  these  units  to  use  government  funds  efficiently  and  satisfy  nonfinancial 
government  objectives.  One  such  incentive  system  is  the  Design-to-Cost  (DTC)  system,  implemented 
for  many  major  weapons  acquisition  projects  in  the  Department  of  Defense  (5],  (14]  in  which  a  DTC 
goal  is  established  for  each  project.  Any  deviations  from  the  goal  are  corrected  by  changing  the  perfor¬ 
mance  of  the  weapons  system  or  by  changing  the  number  of  weapons  produced,  or  both.  This  paper 
constructs  a  simple  model  of  the  information,  incentive  and  decision  aspects  of  such  an  incentive  sys¬ 
tem  and  offers  insights  into  the  tradeoffs  and  policy  issues  involved. 

There  is  substantial  literature  on  the  theory  of  contracts  [15],  [8]  and  the  design  of  incentives  [2], 
[6],  [7],  [10],  [9].  However,  an  examination  of  this  literature  discloses  the  need  for  research  in  two 
areas  that  are  essential  to  an  understanding  of  the  weapons  acquisition  process.  The  first  need  arises 
whenever  a  development  effort  precedes  a  production  effort.  The  dynamic  incentive  process— that  is,  a 
multistage  process  in  which  contractor  behavior  during  any  one  stage  is  affected,  by  the  incentives 
operative  during  that  stage,  and  by  an  expectation  of  rewards  or  punishments  in  the  subsequent 
stages— is  not  addressed  by  the  literature.  The  second  need  arises  due  to  lack  of  consideration  of  any 
but  the  most  simple  of  hierarchies.  Yet  in  any  large  government/contractor  effort,  the  government  is 
represented  by  at  least  three  distinct  organizations  (the  Congress,  the  Administration,  and  the  Bureau¬ 
cracy),  and  the  contracting  agent  may  be  represented  by  several  organizations  as  well  (e.g.,  contractors 
and  subcontractors).  This  paper  addresses  both  the  dynamic  and  hierarchical  aspects  of  contracting  in 
the  DTC  context  (see  [12],  [1]). 

The  weapons  acquisition  process  is  viewed  here  as  a  multistage  process  whose  characteristics 
change  substantially  over  time.  The  acquisition  process  consists  of  (at  least)  three  steps:  (1)  a 
development  stage  in  which  two  or  more  contractors  receive  funds  to  design  and  test  a  prototype 
weapon,  at  the  end  of  which  a  single  contractor  is  awarded  a  production  contract;  (2)  a  production  stage 
in  which  the  winning  contractor  produces  one  or  more  copies  of  the  weapon;  and  (3)  an  implementa¬ 
tion  and  maintenance  stage  in  which  the  weapon  is  maintained  and  modified  in  the  field,  often  with 
some  contractor  support.  The  principal  interactions  occur  between  the  first  two  stages  (i.e.,  contractors 
behave  differently  during  the  development  stage  as  the  award  of  the  production  contract  is  uncertain) . 
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Hence,  we  will  confine  our  analysis  to  the  first  two  stages.  We  assume  that  the  contract  during 
development  stage  is  a  fixed-price  contract  and  the  contract  during  production  stage  is  an  incentive  con¬ 
tract  including  (1)  full  cost  recovery,  (2)  a  reward  or  penally  depending  on  the  cost  of  production 
relative  to  a  negotiated  cost  target,  and  (3)  a  reward  or  penalty  depending  on  weapon  performance  rela¬ 
tive  to  a  prespecified  performance  target. 

The  hierarchical  nature  of  the  DTC  system  is  also  emphasized  in  our  research.  We  examine  four 
levels  of  government  and  contractor  hierarchy.  At  the  highest  level,  representing  the  Congress  and  the 
Administration,  DTC  goals  and  allowable  probabilities  of  exceeding  these  goals  are  established. 
(Weapon  cost  and  performance  are  assumed  to  be  random  variables  whose  mean  values  are  controll¬ 
able.)  At  the  second  level,  representing  DoD  and  the  appropriate  military  service,  the  DTC  goal  is  par¬ 
titioned  into  two  subgoals,  one  for  the  development  stage  and  one  for  the  production  stage,  and  the 
contractors  participating  in  the  development  stage  are  selected.  At  the  third  level,  representing  the  mil¬ 
itary  service  and  its  project  managers,  most  of  the  parameters  in  the  incentive  system  are  established 
and  the  production  contract  is  awarded.  (In  our  analysis,  we  assume  that  the  decisions  at  this  level  are 
established  by  decision  rules  known  in  advance  to  both  the  government  and  the  contractors.)  At  the 
fourth  level,  representing  the  contractors,  contract  parameters  are  negotiated  and  the  levels  of  contrac¬ 
tor  effort  (number  of  personnel,  cost  of  raw  material,  etc.)  are  chosen  for  the  two  stages. 

In  the  following  section  a  model  of  the  DTC  incentive  system  is  developed.  The  model  is  solved 
to  determine  the  impact  of  government  decisions  (contractual  incentive  parameters,  the  allocation  of 
the  DTC  goal  between  development  and  production  stages,  and  the  level  of  risk  acceptable  to  the 
government)  and  the  technological  and  market  environment  of  the  project  on  outcomes  such  as  the 
quality  of  the  weapon  produced,  the  cost  to  the  government,  the  profit  of  the  firms  in  the  industry,  the 
risks  assumed  by  the  government  and  the  risks  assumed  by  the  firms.  Some  of  the  policy  implications 
of  the  model  are  then  illustrated  by  a  series  of  examples. 

2.  THE  BASIC  MODEL 

We  consider  a  given  project  and  assume  that  Congress  has  established  a  Design-to-Cost  (DTC) 
goal,  G  for  the  project.  G  is  understood  to  be  a  constraint  on  total  project  cost  and  it  is  assumed  that  G 
may  be  exceeded  only  with  ex  ante  probability  y.  One  might  anticipate  that  DoD  would  set  y  strategi¬ 
cally  to  trade  off  the  transactions  costs  of  exceeding  budgets  and  exposing  itself  to  (re)  appropriations 
hearings  against  the  internal  transactions  costs  which  occur  if  y  is  small. 

We  assume  that  n  firms  have  been  preselected  as  candidates  for  carrying  out  the  project,  in  two 
stages.  In  the  development  stage,  the  n  firms  compete  against  one  another  in  producing  the  best 
design.  In  the  second  stage,  the  firm  with  the  best  first-stage  design  is  awarded  (the  opportunity  to  bid 
on)  a  production  contract.  To  stale  the  problem  precisely  we  need  the  following  notation. 

=  Effort  expended  by  firm  /  in  stage  v.  In  the  development  stage,  s  =  c/,  and  in  the 
production  stage,  s  =  /?; 

=  Quality  (or  performance  level)  achieved  by  firm  i  in  stage  .v;* 

(■„(<■„)  =  Costs  incurred  by  firm  ;  in  stage  s  as  a  function  of  effort  expended. 


*We  assume  ihai  a  single  measure  of  quality  describes  adequately  ihc  performance  of  the  system  and  that  this  measure  is  additive 
across  the  development  and  production  stages  (i.e  ,  that  development  quality  plus  the  quality  added  during  the  production  stage 
equals  total  quality)  In  practice  the  measure  is  multidimcnsjonal.  and  many  of  the  dimensions  appropriate  to  the  development 
stage  are  not  the  same  as  those  appropriate  to  the  production  stage.  The  former  emphasize  mission  requirements,  proper  exploi¬ 
tation  of  new  technology,  etc  The  latter  emphasize  quality  c<mirol,  delivery  schedules,  etc  We  assume  that  the  government  and 
the  contractor  can  agree  on  a  procedure  (e  g  .  the  calculation  of  a  weighted  sum  of  the  various  dimensions  of  quality)  that  will 
result  in  a  single  additive  measure 
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DoD  is  assumed  to  consider  the  following  types  of  contract.  All  development  contracts  are  firm 
fixed  price  contracts  with  each  of  the  n  firms  involved  receiving  GJn  dollars.*  (7^  <  (7  is  therefore  the 
total  development  cost  to  the  government.  The  production  contract,  if  awarded  to  firm  /,  is  assumed  to 
be  a  general  incentive  contract  with  payments  above  costs  to  firm  ( specified  as; 

U)  rip,  (fp,,  fp,,  Q,/)  =  a  Tp,  +  b  (Tp/  -  +  R,(Qd  +  Qp,(epi)), 

where  random  quantities  have  a  over  them,  and  where 

Tp,  =  Target  cost  rate,  negotiated  by  firm  /  at  the  beginning  of  the  production  stage;  Tp,  is 
assumed  constrained  to  be  nonnegative  (negative  bids  are  not  allowed); 

Qj  =  Cumulative  progress  in  quality  of  the  project  during  the  development  stage,  which  is 
the  starting  point  for  the  production  stage; 

a.b  =  Contract  incentive  parameters,  where  a  ^  0,  0  <  ^  1; 

R{q)  =  Performance  incentive  payment,  for  firm  /,  expressed  as  a  function  of  total  quality 

achieved  over  both  stages. 

At  the  end  of  the  development  stage,  DoD  would  have  spent  exactly  Gj  dollars,  leaving 
Gp  =  G  —  Gj  dollars  in  the  overall  project  budget.  Suppose  firm  /  achieves  the  best  performance  in  the 
development  stage,  i.e.,  suppose 

(2)  =  Qd=  Max  Qajie^j). 


We  assume  that  if  (2)  obtains,  then  firm  /  is  given  the  exclusive  right  to  bid  on  a  production  contract. 
In  a  realistic  setting,  one  might  assume  that  more  than  one  of  the  leading  firms  at  the  end  of  the 
development  stage  is  given  the  opportunity  to  bid  on  a  production  contract.  This  possibility  is  excluded 
here.  Thus,  it  is  assumed  that  the  leading  development  firm,  say  /,  is  interested  at  the  beginning  of  the 
production  stage  in  setting  e^,,  a,  and  b  so  as  to  maximize  its  profits  in  development  stage 
Un,^Tp,.  ep,.  Qj)  where 

(3)  Vp.(Tp,.  ep,.  Q,) - 

E\np,{Tp,.  ep,.  Qp)  +  Qpi(ep,))lQ,AeJ  =  Qj, 

where  F.iq^.Qp)  represents  expected  follow-on  benefits  to  firm  /  (e.g.,  in  terms  of  maintenance  con¬ 
tracts,  future  benefits  from  the  technology  developed,  etc.)'*'  n^,  is  given  in  (1),  and  fl,,,  +  Cp, 
represents  total  (incentive  plus  cost)  payments  made  by  the  government  in  the  production  stage. 


*ln  general  one  would  expect  some  son  of  cost  reimbursement  to  lake  place  during  the  development  stage.  However,  we  as¬ 
sume  a  fixed  price  contract  to  contrast  firm  behavior  in  this  stage  with  firm  response  to  government-established  incentives  in  the 
production  stage  In  Sections  5  and  6  we  examine  the  implications  of  this  assumption  — primarily,  that  some  firms  may  decline  to 
participate  in  the  development  process  This  may  occur  even  under  cost  reimbursement  if  some  firms  find  that  their  expected 
profits,  although  nonnegative,  are  less  than  the  profits  that  would  result  from  alternative  earnings  opportunities. 

♦It  was  mentioned  in  the  introduction  that  the  acquisition  process  consists  not  only  of  a  development  and  a  production  stage,  but 
also  an  ongoing  stage  of  operation  and  maintenance,  including  retrofitting  and  occasionally  major  modification.  We  do  not  expli¬ 
citly  consider  this  third  stage,  so  that  we  may  focus  more  closely  on  the  relationship  between  development  and  production 
Thus,  we  are  concerned  here  w  h  acquisition  costs  and  not  with  life  cycle  costs.  However,  some  of  this  may  be  captured  in  the 
quality  measure  and  the  follow-on  benefits.  That  is,  the  firm  that  produces  a  reliable  and  maintainable  system  is  likely  to  develop 
a  reputation  that  will  lead  to  significant  follow-on  benefits.  The  impact  of  these  follow-on  benefits  is  first  explained  on  page  4  and 
IS  discussed  in  more  detail  later  in  the  manuscript.  (See  also  footnote  on  page  7.) 
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Of  course,  firm  i  will  be  subject  to  some  constraints  in  indulging  with  its  preferences  as 
represented  by  (3).  Indeed,  we  assume  that  "a"  is  fixed  in  advance  by  the  government  and  the  follow¬ 
ing  holds  for  variables 

(4)  Pr|n,,(7’p,.c,„0rf)  +  C„(e„)  >  G,\  <  y. 

where  y  is  specified  by  the  Congress  and  the  Administration.  The  fact  that  firms  accept  (4)  as  a  con¬ 
straint,  of  course,  presumes  that  in  the  case  of  costs  overruns,  acceptable  auditing  practices  can  expose 
and  penalize  firms  which  cannot  make  a  credible  ex  post  case  that  (4)  was  observed  in  their  planning. 
This  dependence  of  contractual  incentives  on  (legitimate)  enforcement  and  monitoring  procedures  can¬ 
not  be  overemphasized. 

Beyond  fixing  "a"  and  imposing  (4),  we  will  assume  that  production  contracts  are  negotiated 
through  one  of  two  methodst  (firm  /  is  the  leading  development  firm): 

Method  1,  MV.  "ti'  is  fixed  ex  ante  and  any  satisfying  (4)  will  be  accepted  by  DoD. 

Method  2,  M2:  Firm  /  and  DoD  negotiate  at  the  beginning  of  the  production  stage 

such  that  (4)  is  satisfied  and  such  that  a  Pareto  efficient  point  is  reached  between  firm  /  and  DoD.  The 
preferences  of  firm  /  are  represented  by  (3).  DoD  is  assumed  to  have  preferences  represented  b>  a 
utility  function  UoiC.CO.Q),  where  Q  =  Qj  -V  Qp,{e^,)  is  final  project  quality,  C  =  Gj  +  11,,,  +  is 
total  project  cost,  and  CO  =  C  —  G  is  the  cost  overrun. 

Formally,  we  may  represent  the  two  production  stage  decision  processes  just  described  as  follows: 


(5)  A/l_:  Maximize  (3)  with  respect  to  ( Tj,,. subject  to  (4). 

(5')  Ml:  Maximize  [aUpi{T„,,e„,,Qj)  + 

(1  -a)  EWd^Gj  +  n^,  -I-  Cp„  G,  +  fi^,  +  C,„  -  G.  Q,  + 
subject  to  (4),  Tp,  >  0,  Cp,  >  0  and  0  ^  h  ^  1 , 


where  a  is  between  0  and  1  and  reflects  the  relative  bargaining  power  of  the  contractor  against  DoD, 
Up,  is  defined  in  (3),  n^,  is  given  in  (1),  Cp,  =  Cp,{ep,)  is  the  cost  for  the  production  stage,  and  Q,i  is 
the  observed  realization  of  (2).  We  will  define  the  optimal  solution  value  to  (5)  or  (5')  as  !>,((),/*■  >his 
is  the  optimal  expected  return  for  firm  /  if  the  ending  quality  level  in  (2)  is  Q,i  and  firm  /  is  awa  ded  the 
production  contract. 


Now  consider  the  development  stage.  Each  of  the  n  firms  involved  may  be  assumed  to  maximize 
the  sum+  of  present  benefits  and  expected  follow-on  benefits  (VpiQ.j)  if  firm  /  is  allowed  to  bid  on  the 
production  contract).  Expected  follow-on  benefits  may  then  be  wruten. 


(6) 


Expected  follow-on  benefits  = 


0  if  <  Qj 


'Method  1  was  analyzed  by  McCall  Mil  for  a  sialic  problem  and  neglecting  (41  He  showed  the  possibility  of  a  hi.is  in  l.isor  ol 
inefficient  firms  arising  from  opportunity  cost  considerations  Such  effects  are  largely  ignored  here,  though  bnelly  coiisnlered  in 
the  spirit  of  Canes  [31  and  Cummins  (4|.  who  also  did  not  consider  any  constraints  similar  to  (41 
+We  ignore  dis-  ounting  here  for  notalional  convenience. 
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From  (6)  we  see  that  an  expected  profit-maximizing  contractor  would  solve  the  following  problem 
in  determining  his  level  of  effort  in  the  development  stage: 

(7)  Max  E[(Gjn)  -  Q(c^,)  -t-  . 

‘‘d, 

where  Q,(Crf/)  is  the  cost  incurred  in  stage  d  for  firm  /  and  where  ,  e^„)  equal  to  1  if 

QdMci)  =“  Qd=  Max  {Qdj(e^j)]  and  0  otherwise.  Note  that  the  probability  that  firm  /  is  allowed  to  bid 

on  the  production  contract  (i.e.,  Pr  {Aj  =  1))  depends  on  the  level  of  effort  of  all  the  n  firms  involved. 
Denote  the  optimal  solution  value  in  (7)  by  where  =  (c^i,  . . .  ,  e^^). 

The  final  step  is  the  determination  of  e^.  This  problem  may  be  formulated  as  a  noncooperative 
game,  with  utility  functions  We  are  interested  in  a  Nash  solution  =  ^(G^.n)  to  this 

game,  i.e.,  a  joint  strategy  ^  satisfying 


(8)  ydi(id'Gd.n)  =  Max  . . frfjicd,  >  0), 

for  every  /  €  {l.  ....  n}.* 


Assuming  e^iG^.n)  is  unique  (see  below)  for  each  G^  and  n,  the  random  variables  C,  CO,  and  Q 
are  determined  by  G^  and  n  through  DoD  is  then  interested  in  determining  Gj  (and  possibly  also 
n)  so  that  its  expected  utility  E{U[)(C,CO.Q)]  is  maximized.  If  firm  /  is  awarded  the  production  con- 


tract,  then 

(9) 

c  =  COST  =Gd  +  (rip,  4) 

(10) 

CO  =  COST  OVERRUN  =  C  -  G, 

(11) 

Q  =  QUALITY  =  Qd,  -t-  Qp,. 

Thus,  DoD  wishes  to  set  Gj  (and  possibly  n)  so  as  to 

(12) 

Max  f  ElUo(Gd  +  ftp,  +  Cp,.  (C,  +  ftp, 

0^  Gj^  G  1 

where  all  quantities  are  evaluated  at  eaiGj.n),  e.g., 

0^,  =  Up,  iTp,{Qd,^ed,)h  Pp,(0rf,(Crf,)).  QdAed,)). 

where  Fp,(0rf).e,,(0<,)  are  the  optimal  solution  to  (5)-(5')  for  given  Qp.  Major  problems  occur  in  solv¬ 
ing  (5)-(5')  and  in  obtaining  ejiG^.n).  to  which  we  now  turn. 


3.  SOLUTION- METHOD  1 


In  order  to  obtain  analytical  results,  it  is  necessary  to  make  assumptions  about  the  forms  of  the 
probability  distributions  and  reward  functions.  Specifically,  we  assume  for  each  /  =  1 . n  that: 

1.  Cdi^ej,)  is  random  quantity  with  expected  value  e}). 

2-  QdS^di^  's  exponentially  distributed,  independently  of  {Qdi^^dj^lj  ^  '1  with  expected  value 
Idi^dn  where  q^,  >  0. 


•This  is  a  Nash  equilibrium,  not  a  dominant  strategy  equilibrium  In  other  words,  if  all  but  one  of  the  players  should  adopt  the 
equilibrium  strategy  and  the  one  player  should  depart,  his  profits  will  be  reduced.  However,  if  two  or  more  players  should  col¬ 
lude  (by  departing  from  the  Nash  equilibrium  and  sharing  their  profits),  they  might  each  be  able  to  realize  higher  profits.  This 
behavior,  which  would  almost  certainly  constitute  conspiracy  to  defraud  the  government,  is  not  considered  in  our  model.  In  ad¬ 
dition.  we  assume  that  each  competitor  for  the  production  contract  is  aware  of  the  technological  capabilities  and  the  values  (e  g., 
for  follow-on  benefits)  of  the  other  competitors 
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3.  Cpiieni)  and  Qpi(epi)  are  jointly  normal  with  respective  means  e^j  and  qpiepjiqpi  >  0),  respective 
variances  o-j,  and  and  with  positive  correlation  coefficient  8^;. 

4.  Ri(Q)  =  0,  i.e.,  performance  incentive  payments  are  nil. 

5.  FtiQd’Qp^  =  Hi  +  hjiQj  +  hpiQp,  where  >  0,  hp,  >  0  are  constants. 

For  this  data  we  may  write  (5)  as 

(13)  Max  [(a  +  b)  -  be^i  +  H,  +  hj,Qd  +  hp,qp^ep,] 

subject  to: 

(14)  Pr{(a  +  b)  Tp,  -  b  CpMp.)  +  CpMpi)  >  Gp]  <  y. 

Collecting  terms,  (14)  may  be  rewritten  as: 

(15)  Pr{[(l-())C^,(e^,)l  ^  (G^-  (a +  6)7'^,])  <  y. 

Since  Cp,  is  normal,  {\  —  b)Cpi  is  also  normal  with  mean  {\  —  b)e^i  and  variance  (1  — 6)Vp,  so  (15) 
may  be  expressed  as 

(16)  [(1-6)^’;,^  +  (a  +  b)Tpi  -  Gp\  +  {\- bUp^Kiy)  <  0 

where  K  (y)  is  the  (1  -  y)"’  fractile  of  the  unit  normal,  i.e.,  Pr(/V(0, 1)  >  A"  (y))  =  y. 

Define  kpi(y.b)  through 

(17)  kp,(y.b)  ^  K(y)(l-b)a-p,. 

Then  (16)  becomes 

(18)  [i\-b)ep^i  +  (a  +  b)  Tp,  -  Gp]  ^  -  kp^iy.b). 


Thus,  the  constraint  (14)  may  be  written  as  (18).  Since  6  <  1,  we  see  that  (18)  defines  a  convex 
region  for  every  value  of  Q^.  Note  also  that  dkpJBy  <  0  and  dkpjdb  <  0.  Thus,  as  y  or  decreases 
the  constraint  region  becomes  larger.  Similarly,  as  Qj  decreases  the  constraint  region  becomes  larger. 


To  find  the  optimal  in  (13),  note  that  whatever  is,  the  optimal  Tp,  will  be  set  so  that 

(18)  holds  as  an  equality  since  otherwise  firm  /  could  simply  increase  Tp,  with  consequent  higher  profits. 
Solving  for  (a  +  b)Tpi  in  (18),  we  see  therefore  that,  at  the  optimum, 

(19)  (a  +  b)Tp,  =  Gp  -  kpSy.b)  -  (\-b)e^,. 

Thus,  substituting  in  (13)  for  (a  +  b)  Tp,,  the  following  problem  characterizes  the  optimal  e^,. 

(20)  Max  l-e^i  -  kp,(y,b)  +  Gp  +  H,  +  +  hpiqp^ep,]. 

subject  to  Cp,  ^  0  and  Tp,  ?  0.  Using  (19),  the  nonnegativity  constraint  on  Tp,  may  be  expressed  in 
terms  of  Cp,  as 

(21)  (\-b)e},  ^  Gp- kp,{y.b). 


Thus,  the  problem  of  interest  is  to  maximize  (20)  subject  to  Cp,  ^  0  and  (21). 
programming  problem  has  the  solution 


(22) 


Min 


^pi  Qpi 

Gp  -  kp,(y.b) 

1 

2 

2 

\-b 

This  simple  quadratic 
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and  the  optimal  target  cost  Tp,  is  therefore  determined  by  (19)  as 

ia  +  b) 

Finally,  the  optimal  value  of  the  objective  function  in  (20)  (respectively  in  (13))  is  obtained  by  substi¬ 
tuting  Cpi  for  epi  in  (20).  This  yields 

(24)  =  Kp,  +  hj.Qj. 

where  Kp,  is  independent  of  Qj  and  is  given  explicitly  by 

(25)  Kpi  =  Gp-  kpi(y,b)  +  H,  -  +  hp,qp,ep,. 

Notice  from  (22)  that  firm  /  will  expend  only  the  minimum  effort  (here  ep,  =  0)  in  stage  P  under 
Method  1  contracting  unless  there  is  some  promise  of  follow-on  rewards  from  such  effort  (i.e.,  unless 

V  >  0).* 

From  (7)  and  (24)  we  see  that  firm  /  solves  the  following  problem  in  determining  its  level  of 
development  effort  ej,: 

(26)  Max  E{(GJn)  -  cj  +  [Kp,  +  /id/0d/(ej,)]^,(erf,,fi,n)), 

where  =  (c^,,  ej„).  We  have  used  the  assumption  in  (26)  that 

£{Q,(crf,)l_=  eji  and  also  the  fact  Ai{ej,n)  =  1  precisely  when  firm  i  achieves  the  maximum  in  (2); 
otherwise  A,iea.n)  =  0. 

We  first  evaluate  the  following  expression  in  (26): 

(27)  EP  =  E{[Kp,  +  h,,Q„(eJ\A,(e,„^,n)]. 

££ represents  the  expected  returns  from  the  production  stage  as  seen  by  firm  (at  the  beginning  of  stage 
d. 

We  first  note  from  (2)  that 

(28)  PrlAfiej.n)  =  1}  =  ^  QdAe,/,)]  for  all ./  =  1 . n}, 

or  using  the  assumed  independence  of  {0^;  !./  =  1 . ") 

(29)  Pr{^’,(^,n)  =  U  =  O  Prla/(ea/)  <  QM). 

/=! 

Thus,  if  Fjiiq.ej,)  =  Pr{0, //?,/,)  <  q}  is  the  cumulative  distribution  function  of  Qdjicji)-  we  may  write 

(29)  as 

(30)  Pr{A,{e,,n)  =  1|  =  f]  WJej,),e„). 


"This  striking  result  can  be  explained  quite  simply.  If  a  firm  is  not  concerned  with  follow-on  benefits  and  is  not  directly  rewarded 
for  quality  performance,  it  has  no  incentive  to  expend  more  than  the  minimum  required  effort  during  the  production  stage.  This 
behavior  cannot  be  altered  by  changing  the  method  of  competition  for  the  production  contract,  but  rather  by  (1)  incorporating 
minimum  acceptable  levels  of  effort  or  quality  into  the  production  contract.  (2)  rewarding  the  firm  directly  for  deliverying  a  quali¬ 
ty  system,  and/or  (3)  encouraging  the  firm  to  believe  that  there  are  follow-on  benefits  of  producing  a  quality  system.  See  also 
the  article  by  General  Tashjian  (161,  who  argues  that  in  a  two-stage  (development  and  production)  contract  the  government 
might  motivate  contractors  to  meet  quality  standards  and  the  design-to-cost  goal  if  it  states  in  the  development  contract  its  inten¬ 
tion  to  cancel  the  program  if  the  design-to-cost  goal  is  not  met.  As  we  point  out  in  Section  6.  determining  the  perceptions  of 
contractors  concerning  future  government  actions  (and  hence,  their  follow-on  benefits)  is  an  important  topic  for  future  research. 
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Finally,  using  (30),  (27)  becomes 

(31)  EP  =  J  ^  ([A:^,  +  Ej,{x.c^,))/j,(x.e,,,)dx. 

where  /^/(x.e^,)  is  the  probability  density  function  of  Qj,(e,,,). 

In  the  exponential  case  considered  here,  (31)  becomes 
1 


(32) 


EP  = 


r 

[A;,,  +  p[ 

1  -  exp 

-V 

exp 

A' 

‘/d,i’,i, 

dx. 


Restricting  attention  to  «=  1  or  2,  we  obtain 
1 


(33)  EP(n=\)  = 
and  setting  j  ^  / 

(34)  EP(n  =  2)  = 


J,  oo 

„  +  hj,x]  exp 


dx  =  A' ,  +  hj,qj,e 


r 


[  A”^,  +  hjf  Qji  Cji  1 


IA„,  +  hj,x\ 


1  -  exp 


exp 


dx 


ddi^dt 


Qj/^di  +  ddi^d, 


Comparing  (33)  and  (34),  it  is  interesting  to  note  that  for  any  given  level  of  effort  during  the  develop¬ 
ment  stage  the  ex  ante  expected  returns  from  the  production  stage,  which  we  denoted  above,  are 
less  for  firm  /  if  2  firms  compete  for  the  production  contract  than  if  firm  /  alone  is  guaranteed  the  pro¬ 
duction  contract. 

Now,  given  (33)-(34),  we  may  easily  solve  (26)  for  the  optimal  development  effort  e,/,,  assuming 
the  other  firm’s  effort  fixed  at  e^,. 

When  n  =  1,  of  course,  there  is  no  other  competing  firm  and  substituting  (33)  in  (26)  yields  the 
following  as  the  appropriate  problem  for  firm  /  (if  firm  / is  the  only  development  firm): 


(35) 


Max  [Gd  -  ej,  +  K„,  +  hjiq,„eJ. 


which  has  the  unique  solution 

(36)  e,,, - — 

yielding  overall  profits  for  firm  /  of 


(37) 


^dS^di-Gd)  =  Gd  +  Kp,  -f 


I’d.dd, 


(38) 


When  n  =  2,  matters  are  more  complicated.  Substitution  of  (34)  in  (26)  yields 

^dl  ^*dl 


Max 


[(GJ2)  -  e,l  +  [A'p,  +  /id, i/d, (’d, I 


(/.iii’di  +  ‘/d,<‘.i, 


Taking  first-order  conditions  in  (38),  while  assuming  cj,  fixed,  we  obtain 

A/,/  qj,  dti,  i’d. 


(39) 


e,y,  = 


(2A^  -  /t,/,(/i(A  -t-  v,/,<V,)l 
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where 

(40)  4k  =  qjii’ji  + 


We  seek  a  Nash  solution,  defined  by  (8),  which  would  be  a  simultaneous  solution  to  (39)  and  tl^ 
corresponding  equation  for  firm  ./,  i.e.,  to  (39)  and 

j  j  c  —  _ _ 

(2A‘ -  /).,,qi,(A  +  q.y,o,)l 

Assuming  A  fixed,  and  /),*,  h,/,  >  0,  the  simultaneous  solution  to  (39)  and  (41)  is 
e„(A)  = 


(42) 

(43) 

where 

(44) 


-\ 


(A(2A  -  hj,qj,)lu,qj,ql  +  hj,K^,q],qj, 
-A 


[A(2A  -  hj,qJi)li,,,qJ,qj,  +  hjiKp,qJ,qj,] 

A  =  [K^,K^,q},ql  -  (2A  -  hj^q},)  (2A  -  hj,ql)^^. 


Now,  from  (40)  the  Nash  solution  =  e,/(A)  we  seek  must  clearly  satisfy  (42)-(43)  and 
(45)  </,/iea|(A)  +  qj2edi^i)  =  A- 

Thus,  multiplying  (42)  (respectively,  (43))  by  </,/,  (respectively,  qjJ  and  adding  the  results  leads 'te' 
(45),  which  in  general  is  a  polynornial  of  degree  6  in  the  variable  A.  Numerical  solution  procedures 
easily  yield  A  in  general,  and  once  A  is  obtained  so  also  is  the  desired  Nash  point  ej  from  (41)-(42). 
from  which  all  other  desired  information  may  be  obtained.  In  this  paper  we  will  not  proceed  further 
with  the  general  case.  However,  we  discuss  two  cases  which  may  be  solved  analytically. 


CASF  1:  hj,  =  0  for  all  /.  In  this  case  (39)-(41)  can  be  solved  directly  to  yield 

(46)  e„  =  tJk;,.  =  Tv^, 
where 

(47)  f  —  (^/7|^;i2('^‘' 

In  this  case  it  can  be  shown  that  deji/dqji  has  the  same  sign  and  the  opposite  sign  of 

-  ?<// As  expected  beJbK^,  >  0  always  holds. 


CASE  2:  Two  identical  firms.  When  q^,  =  qj,  =  q,/,  //,/,  =  =  /i^,  and  =  Kp.  we  can 

again  solve  (39)-(41)  explicitly,  obtaining 

4-  ^9/iJqJ  +  32Kp 

(48)  (Vi  =  (V2  =  - - - • 

Here  all  the  relative  change  effects  are  obvious  and  in  the  expected  (positive)  direction.  An  interesting 
point  to  note  from  (48)  (or  (46)-(47))  is  that  when  h,/  =  0,  the  amount  of  effort  expended  in  develop¬ 
ment  is  independent  of  quality. 

This  concludes  our  discussion  of  Method  I  contracting  (see  (5)).  Before  considering  further  the 
government's  problem  in  this  regard,  let  us  turn  our  attention  briefly  to  Method  2  contracting  (see 
(5')). 
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4.  SOLUTION- METHOD  2 

We  continue  to  make  the  cost  and  distributional  assumptions  1-5  of  the  previous  section  In 
Method  2  contracting  the  stage  n  behavior  of  the  production  contracting  firm,  say  /.  is  determined  as  a 
solution  to  (5'),  except  that  we  further  restrict  b  so  that  h  ^  b  0,  with  b  being  some  minimal  shar¬ 
ing  rate  set  by  Congress.*  We  assume  the  DoD  utility  function  is  specified  linearly  as 

(49)  Uo(C.CO.Q)  =  -.Cl  C  -  ,c'2  CO  +  m  Q. 

where  ^,  >  0,  /  =  1 . 2,  3,  Then,  for  given  a  €  (0.1 ),  we  may  write  the  problem  (5')  as  follows; 

(50)  Maximize  EV  =  aE  (H^,  -I-  -F  (I  -  a)  E{U„{C.  CO.Q)\Q,i,  =  Qj\ 

=  a[(a  -h  b)  Tp,  -  be/, 

+  (M,  +  h,i,Q,i  -F  bj„Qi„ei„)] 

+  (1  —  a)  [— A'|((j,/  +  Cln,,,  -F  C^,(c,„)l) 

-  -F  £  {n^,  -F  Cp,(<>^,)|  -  G) 


Subject  to:  (4)  and  b  ^  b  ^  1. 

Note  that_  the  expected  total  project  cost  (to  the  government)  and  quality  (given  Qj)  are,  respectively. 
Gd  +  £|n,„  +  c^,(c,„)l  and  Qj  +  ElQf„(ep,)}  =  Qj  +  qp,e^,.  Now  we  note  that 

(51)  £(n„  +  Cp.iep.)]  =  («  +  b)Tp,  -F  (I  -  b)e/. 

Now,  under  our  assumptions,  (4)  may  be  rewritten  in  the  form  (18).  Moreover,  as  in  Section  3. 
it  may  be  shown  here  that  for  any  fixed  b  ^  [b.\]  the  solution  to  (50)  is  on  the  boundary  of  the  con¬ 
straint  set  (18)  provided  only  thaff 


I  +  +  ,k2  ' 

Condition  (52)  may  be  viewed  as  a  lower  bound  on  the  bargaining  power  of  firm  /.  We  henceforth 
assume  (52)  so  that  (4)  (i.e.,  (18))  holds  as  an  equality.  Just  as  in  Section  3.  we  can  now  substitute 
(19)  in  (50)  to  obtain  the  final  problem  of  interest: 

(53)  Maximize  (-ae/,  +  [«/?,„  -F  ( 1  -  a  ),y, ](/„,<>,  +  Qjla/i,,,  -F  (!-«).?,)  -F  TVib)). 


Subject  to:  Tp,  ^  0.  Cp,  ^  0.  b  ^  b  ^  \ , 

where  the  term  TUis  independent  of  Cp,  and  Q,,  and  is  given  by 
(54)  rU(A)  =  a//,  -  (l-a)A',G;/ 

-F  [a  -  (1  -  a)^r|lG'^ 

-F  [(1  -a)  (.gi  -F  g2(  ~  oi]kp,{y.b). 

‘See  also  Canes  (31,  for  a  similar  assumption  and  a  discussion  of  some  rationale  for  establishing  such  a  lower  bounding  sharing 
rate 

rWhen  (52)  does  not  hold,  the  solution  to  (51)  appears  to  be  somewhat  complicated  as  the  solution  need  no  longer  be  on  the 
boundary  of  (18)  Details  for  this  more  general  case  have  not  yet  been  worked  out, 
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We  may  first  note  that  (52)  implies  [or  —  (l-aKjei  +  ;e2^1  ^  0-  coupled  with  (see  (17)) 

dkpjdb  <  0  implies  that  the  optimal  solution  for  ft  in  (53)  is  ft  =  ^  (note  that  the  only  term  containing 
ft  is  la  -  (1  -aHjCi  +  kpiy.h).  To  obtain  we  take  first-order  conditions  in  (53)  and  find 

(/,  —  kjjy  ,h) 

1  -  ft 


(55) 


ep,  =  Min 


loft,.,  +  <1  -o)^j  !</,,, 


2a 


and  fp,  is  again  found  by  substituting  Cp,  and  ft  =  ft  into  (19)  to  obtain 


(56) 


[(a»  -  kpiy.b) 


<l-ft)c,.-l 


(a  +  ft) 


Substituting  ft  =  ft  and  in  (551  into  (53).  we  see  that  Method  2  leads  to  exactly  the  same  form 
of  solution  value  (see  24)  as  Method  1  (where  the  denotes  Method  2  values): 

(57)  t;,(g,)  =  A-;,  + 
where  for  Method  2 

(58)  Kpi  =  —aVp,  +  [ahp,ilp,  +  (I  ~  a  l.e-.r/,.,)*',,,  +  /•  fft) 
and 

(59)  ftj,  =  [aftj,  +  (1  —al.Cil 

From  this  we  see  that  the  solution  procedure  and  results  for  Method  1  in  stage  </  are  completely 
transferable  to  Method  2,  with  Kp,  and  ft,;,  substituted  everywhere  for  and  ft,,,. 

Before  closing  our  analysis  of  Method  2  it  is  of  interest  to  note,  comparing  (22)  and  (55).  that 
effort  expended  in  the  production  stage  is  always  greater  under  Method  2  than  under  Method  1  con¬ 
tracting.  More  detailed  comparative  analysis  of  the  other  parameters  and  decisions  will  be  explored  in 
the  next  section  via  numerical  analysis. 

5.  ILLUSTRATIVE  RESULTS 


We  illustrate  the  concepts  and  results  of  the  previous  sections  with  a  numerical  example,  solved 
in  APL  on  the  DEC  System  10  at  The  Wharton  School.  We  analyze  the  impact  of  the  following  param¬ 
eters  on  the  behavior  of  the  firm  and  on  the  outcome  of  the  project;  variations  in  the  risk  sharing 
parameter  (b)  and  partitioning  of  the  (fixed)  total  government  budget  between  the  production  budget 
(Gp)  and  the  development  budget.  We  consider  three  industry  configurations:  two  identical  firms,  two 
firms  with  different  levels  of  productivity,  and  a  single  monopolistic  firm.  The  values  of  the  parameters 
used  in  these  experiments  are  given  in  Table  1.  One  can  interpret  these  figures  by  assuming  that 
money  is  measured  in  dollars  and  that  quality  is  measured  in  miles  of  range  of  the  weapon  (e  g.,  an  air¬ 
craft  or  missile).  Simulations  are  run  for  Method  1  and  for  Method  2  with  a  =  .8  and  a  =  .9.  .A  sam¬ 
ple  of  the  output  for  the  two  identical  firms  with  Method  1  appears  in  Table  2.  The  remaining  analy  ses 
are  based  on  similar  outputs  for  the  other  cases. 

The  impacts  of  the  negotiation  process  (b)  and  budget  allocated  to  production  (Gp)  on  costs,  qual¬ 
ity,  and  the  mean  and  variance  of  profit  are  shown  in  Table  3.  These  relationships  are  identical  across 
all  three  industry  structures.  The  mean  and  variance  of  cost  to  the  government  is  the  same  for  Method 
1  and  Method  2,  whatever  the  value  of  a.  However,  the  expected  quality  and  the  expected  cost  to  the 
firms  are  higher  for  Method  2  than  for  Method  1,  and  within  Method  2  they  are  higher  when  a  is  at  its 
lower  value.  In  addition,  as  the  expected  cost  of  the  firm  increases  from  method  1  to  Method  2.  the 
expected  profit  (including  intangibles)  decreases.  These  effects  occur,  because  Method  2  gives  the 
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TABLH  3  —  ComiHirison  o!  Methods  for  Am  Industrv  Structure 


Valid  for  any  value  of  band  G,, 

Variable 

Method  1 

Method  2 

a  =  .9 

a  =  .8 

Expected  Cost  to 
the  Government 

.Same 

Same 

Same 

Expected  Total  Oualiiy 

Low 

Medium 

High 

Expected  Cost  to 
the  Firm 

Low 

Medium 

High 

Expected  Profits 
of  the  Firm 

High 

Medium 

Low 

Intangible  Reward 

Low 

Medium 

High 

Variance  of  Profit* 

High 

Medium 

Low 

Variance  of  Cost  to 
the  Government 

Same 

Same 

Same 

_ 

‘l  or  the  nionopolisiic  induMry  siruciuro,  this  variable  is  constant. 


go\ornmcni  more  byrgainning  power  ihan  Method  1,  and  this  bargainning  power  increases  with 
deeroasing  a.  Thus,  we  obtain  highest  expected  cost  of  firm  and  lowest  expected  profit  at  a  =  0.8. 

The  impact  of  the  risk  sharing  parameter  (b)  and  the  portion  of  the  budget  allocated  to  production 
(G.,  I  on  those  variables  are  shown  in  Table  4.  With  regard  to  the  risk  sharing  parameter,  the  results  are 
what  one  would  expect,  with  one  exception.  As  h  increases,  for  any  industry  structure,  the  develop¬ 
ment  and  production  efforts  of  each  firm  increases  as  long  as  the  production  target  of  the  firm  is  zero. 
The  production  effort  remains  constant,  with  increasing  b,  once  the  target  becomes  positive.  That  is.  as 
h  decreases,  each  firm  attempts  to  respond  by  decreasing  its  target  without  changing  its  production 
cfl'ori.  But  as  the  target  is  constrained  to  be  nonnegative,  the  firm  meets  the  design-to-cost  goal  by 
decreasing  its  production  effort.  We  also  observe  that  when  industry  structure  is  monopolistic,  develop¬ 
ment  effort  is  independent  of  h.  This  occurs  because  the  monopoly  firm,  assured  of  the  contract,  puts 
forth  minimal  effort  at  development  stage  to  obtain  intangible  follow-on  benefits.  As  a  result,  quality 
of  the  weapon,  cost  to  the  government,  cost  to  the  firm,  profit  of  the  firm  (including  intangible 
rewards),  intangible  rewards,  and  target  increase  (weakly)  with  increase  in  b.  In  addition,  the  variance 
of  the  government  cost  decreases  with  increasing  b,  since  the  firm  assumes  more  risk.  However,  the 
variance  of  the  firm's  profit  also  decreases  as  the  firm  assumes  increasing  risk  and  this  is  an  interesting 
result. 


The  explanation  of  this  counterintuitive  result  arises  partly  from  the  fact  that  production  c.’st  and 
quality  are  correlated  (which  introduces  a  negative  term  in  the  variance  calculation  whose  derivative 
ma>  be  dominant)  and  partly  from  the  assumptions  and  parameter  values  used  in  these  experiments. 
We  begin  by  noting  that  the  variance  of  profit  (including  intangibles)  for  the  firm  is  given  by 

Vardl,)  =  +  hj,q},e},  +  Var(C,/,) 


and  thus. 


J  Vardl,) 

Jb 


(/Pa. 


=  Ibirj,,  -  I-  2h,f,qj,ej,--j^  + 


rfVar(Q,) 

db 


I  he  variance  of  profit  to  the  firm  will  increase  or  decrease  with  has  the  above  result  is  positive  or  nega¬ 
tive  f  or  the  parameter  values  used  in  these  experiments,  the  result  will  always  be  negative  as  long  as 
w,  the  cocITicient  of  variation  of  C\,,.  is  below  ,7  and  will  always  be  positive  for  «>  >  2.  For  intermedi¬ 
ate  values,  the  variance  of  II,  will  decrease  for  low  values  of  b  and  will  thereafter  increase. 
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TABLE  4  —  Impact  of  b,  G^,  and  Industry  Structure  on  Variables 


Effects  on  variables  as  b  and  Gp 

change 

As  b  Increases 

As  Gp  Increases 

Variable 

Different 

Identical 

One 

Different 

Identical 

One 

Firms 

Firms 

Firm 

Firms 

Firms 

Firm 

Expected  Total 

Quality 

INC 

INC 

LIM 

INC 

INC 

LIM 

Expected  Cost  to 
the  Government 

INC 

INC 

INC 

SAME 

SAME 

SAME 

Expected  Cost  to 
the  Firms 

INC 

INC 

LIM 

INC 

INC 

INC 

Expected  Profit 
of  the  Firms 

INC 

INC 

INC 

CHG 

CHG 

LIM* 

Expected  Intangible 
Rewards 

CHG 

INC 

LIM 

CHG 

INC 

LIM 

Production  Target 

INC 

INC 

INC 

INC 

INC 

INC 

Expected  Quality 
at  Development 

INC 

INC 

SAME 

INC 

INC 

SAME 

Expected  Quality  at 
Production 

CHG 

LIM 

LIM 

CHG 

LIM 

LIM 

Variance  of  Cost 
to  the  Government 

DEC 

DEC 

DEC 

SAME 

SAME 

SAME 

Variance  of  Profit 
of  the  Firm 

_ 

DEC 

DEC 

DEC 

INC 

INC 

SAME 

*This  applies  only  to  Method  1. 

For  Method  2,  the  profit  increases  and  then  decreases. 

LEGEND: 

INC:  increases 

DEC:  decreases 

SAME:  no  change 

CHG:  increases  for  small  values,  followed  by  decrease 

1  LIM:  increases  until  Tp  becomes  positive,  followed  by  no  change 

We  also  note  that  the  variance  of  cost  to  the  government  is  given  by  (1  —  b)^^(Tj,p,,  where  p,  is 

I 

the  probability  that  the  /"'  firm  will  receive  the  production  contract.  When  o-p,  is  independent  of  / 
which  is  the  case  here,  the  variance  of  cost  to  the  government  is  (1  -  and  this  will  always 

decrease  with  b. 

Because  increase  in  the  risk  sharing  parameter  bring  about  strict  increases  in  total  cost  and  total 
quality,  risk  sharing  allows  the  government  to  obtain  a  cost/quality  trade-off  consistent  with  its  goals. 
However,  one  may  expect  that  beyond  a  certain  point,  quality  increases  slowly  with  h  (and  in  the  case 
of  the  monopoly  firm,  will  not  increase  at  all),  while  cost  continues  to  increase  proportional  to  increas¬ 
ing  h.  This  is  the  point  where  the  target,  or  one  or  more  of  the  targets  for  nonidentical  firms,  becomes 
positive. 

The  situation  is  more  complex  when  the  total  government  budget  is  partitioned  between  the 
development  and  production  stages.  We  note  that  total  quality  increases  as  Gp  increases,  because  an 
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increase  in  6,,  relaxes  the  design-to-cost  constraint.  But,  expected  cost  to  government  and  the  variance 
of  that  cost  remain  constant.  Thus,  the  incentive  to  the  government  is  to  make  Gp  as  large  as  possible. 
However,  there  may  be  a  limit  to  the  size  of  that  is,  it  may  not  be  possible  to  let  G,,  equal  G— 
because  the  firms  may  refuse  to  compete  due  to  inadequate  compensation  during  the  development 
stage.* 


With  regard  to  industry  structure,  in  many  cases  variables  such  as  cost,  quality,  and  profit  have 
the  same  relationship  to  each  other  for  all  values  of  h  and/or  for  all  values  of  G,,.  This  is  illustrated  in 
Table  5.  The  government  receives  the  highest  quality  weapon  at  the  same  or  at  lower  cost  when  two 
different  firms  are  competing  for  the  contract,  whereas  it  receives  the  lowest  quality  system  at  the  same 
or  at  higher  price  when  the  industry  structure  is  monopolistic.  This  is  an  expected  result.  Expected 
profit  is  highest  for  the  monopoly  firm  and  is  lowest  when  the  industry  consists  of  two  identical  firms. 
The  two-firm  industry  receives  more  profit  when  the  firms  are  different,  because  one  of  the  two  firms  is 
less  productive  than  the  other,  and  the  decrease  in  expected  profit  for  this  firm  is  not  offset  by  the 
increase  in  expected  profit  for  the  more  productive  firm.  In  general,  one  would  expect  the  benefits  of 
competition  to  diminish  as  the  quality  of  the  inefficient  firm  decreases.  When  the  inefficient  firm  has 
low  quality,  the  efficient  firm  will  not  perceive  a  credible  competitive  threat.  Sc.  increasing 
government’s  expenses  on  development  does  not  repay  in  increased  quality. 


TABLE  5  —  Impact  of  huhistry  Striiciitrc  on  Variables  for  all  Values  of  b  and  of  Gp 


For  all  values  of  b 

For  all  values  of  Gp 

Two 

Two 

One 

Firm 

Two 

Two 

Variable 

Different 

Firms 

Identical 

Firms 

Different 

Firms 

Identical 

Firms 

Firm 

Expected  Quality 

H 

M 

L 

H 

Expected  Cost  to 
the  Government 

S 

S 

H 

S 

HI 

Expected  Cost  to 
the  Firms 

H 

M 

L 

F 

Expected  Profit 
of  the  Firms 

M 

L 

H 

M 

L 

1 

H 

Intangible  Reward 
(Method  1) 

M 

L 

M 

H 

M 

L 

Intangible  Reward 
(Method  2) 

H 

M 

L 

F  1 

1 

F 

1 

L 

Variance  of  Profit 

H 

M 

L 

H 

M 

L 

Variance  of  Cost 
to  the  Government 

S  1 

_ ^ 

S 

S 

S 

S 

S 

LEGEND. 

H; 

high  value 

M; 

medium  value 

L: 

low  value 

S: 

.same  value 

F; 

fluctuates 

*We  assume  lhal  lirm.s  have  an  inccnlive  lo  parliijpatc  in  Ihe  development  stage,  even  if  their  expected  prolil  is  negative  or  is 
below  their  alternative  earnings  opportunities.  Eor  example,  the  firms  may  wish  to  "slay  in  the  game"  and  remain  visible  to 
DoD.  a  type  of  follow-on  henelil  not  quanlitied  in  our  model  However,  within  ihis  constraint  they  wish  to  maximize  expected 
profit  plus  follow-on  benefits,  which  will  lead  the  winner  of  the  production  contract  to  minimize  his  effort  during  the  production 
stage  when  these  follow-on  benefits  are  nonexistent  li  e.,  when  hp,  -  01  As  we  note  in  Section  6,  examining  the  effects  of  re¬ 
moving  Ihis  assumption  is  an  important  topic  for  future  researcii 
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The  results  described  above  are  based  on  the  assumptions  that  all  firms  in  the  industry  will  coni- 
pete  for  the  production  contract  regardless  of  the  expected  profits  and  that  the  government  will  allow  all 
firms  in  the  industry  to  compete  (by  paying  a  fixed  cost  for  development)  regardless  of  the  productivity 
of  the  firm.  These  assumptions  do  not  give  rise  to  anomalies  within  the  parameter  ranges  used  here, 
contrary  to  expectation.  For  example,  we  observe  that  an  increase  in  Gp  (and  a  corresponding  decrease 
in  the  funds  paid  to  the  firms  for  their  development  efforts)  results  in  an  increase  in  quality  with  no 
change  in  the  expected  cost  to  the  government,  and  also  results  in  an  increase  in  expected  cost  to  the 
firms,  regardless  of  industry  structure.  However,  in  reality,  most  firms  have  alternative  uses  for  their 
resources  (current  and  fixed  assets,  experienced  managers,  skilled  workers,  etc.),  and  some  of  them 
may  decline  to  participate,  once  their  expected  profits  do  not  compare  favorably  with  those  obtainable 
elsewhere.  In  fact,  the  existence  of  a  negative  constant  term  Kpiiy.b)  in  (23)  can  result  in  negative 
expected  profits  for  suitable  contract  parameter  values.  This  will  almost  certainly  cause  a  firm  to  with¬ 
draw  from  participation  at  the  development  stage. 

Generally,  the  selection  of  government  contract  parameters  (Gp,  y,  6,  and  a)  must  be  compatible 
with  the  alternative  earning  opportunities  of  the  firm.  Such  alternative  market  opportunities  determine 
a  set  of  contract  parameters  for  each  firm  at  which  the  firm  would  be  willing  to  participate  in  the 
development  effort  and  compete  for  the  production  contract.  The  government  will  desire  a  possibly 
different  set  of  values  for  the  contract  parameters,  for  which  the  cost  is  low,  variance  of  cost  is  low  and 
the  quality  is  high.  The  government  must  choose  its  values  of  Gp,  y,  b,  and  a  from  the  set  of  contract 
parameter  values  which  will  guarantee  prospective  contractors  earnings  opportunities  as  attractive  as 
their  alternative  market  opportunities. 

Figure  1  illustrates  an  example  of  this  tradeoff.  The  figure  shows  a  situation  where  the  efficient 
and  inefficient  firms  expect  returns  higher  than  $102,300,000  and  $62,800,000,  respectively.  This 
expectation  leads  the  firm  to  accept  b  and  Gp/G  only  in  specified  ranges  (as  shown  by  the  shaded  area). 
The  government  has  to  choose  its  acceptable  6  and  Gp/G  depending  on  budget  allocation,  risk  sharing, 
and  other  parameters.  If  these  parameter  values  of  the  government  fall  in  the  shaded  area,  the  govern¬ 
ment  can  expect  the  firms  to  bid  for  the  project.  If  not,  the  government  may  have  to  revise  its  policy. 

6.  CONCLUSIONS  AND  FURTHER  RESEARCH 

We  have  examined  the  impact  of  dynamics  and  hierarchy,  including  industry  structure,  on 
government  and  industry  behavior  in  a  design-to-cost  context,  using  a  model  of  the  weapons  acquisition 
incentive  process.  Most  of  our  results  are  a  quantitative  verification  of  what  we  would  qualitatively 
expect.  As  risk  sharing  increases,  the  firms  put  forth  more  effort  (except  for  the  monopolistic  firm  dur¬ 
ing  the  development  stage)  and  produce  a  better  quality  weapon  at  higher  cost.  The  major  counterin¬ 
tuitive  result  is  that  the  variance  of  the  government’s  cost  decreases,  and  for  some  values  of  the 
economic  and  technological  parameters,  the  variance  of  profit  of  the  firm  decreases  as  the  risk  sharing 
by  the  firm  increases.  In  addition,  the  government  receives  a  higher  quality  weapon  for  a  fixed  budget 
when  it  deals  with  competitive  firms  than  it  does  when  it  must  deal  with  a  single  monopolistic  firm,  and 
within  limits,  the  quality  increases  when  bidding  firms  are  diverse  in  their  capabilities.  The  government 
also  receives  a  higher  quality  product  when  it  invests  a  higher  percentage  of  its  budget  in  production 
(relative  to  development),  unless  the  shift  in  investment  causes  some  firms  to  withdraw  from  the  com¬ 
petitive  development  phase  of  the  acquisition  process. 

The  interaction  between  government  policy  and  industry  structure  suggests  a  productive  direction 
for  further  research.*  Government  policies  influence  industry  structure  and  also  affect  the  structure  of 

*We  are  grateful  to  the  referee  for  pointing  out  that  Dot)  Directive  5000  28  on  Design  to  Cost  is  undergoing  revision  It  is  possi¬ 
ble  that  the  research  reported  here  or  future  research  of  the  type  suggested  in  this  section  will  be  of  use  in  formulating  these  and 
other  government  acquisition  policies 
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Fioimt  1.  Areas  of  Eirm  Atcepiahility 

the  .subset  of  the  industry  participating  in  the  acquisition  process  and  possibly  in  the  long-term,  the 
structure  of  the  defense  industry  as  a  whole.  In  order  to  model  the  interaction  between  government 
policy  and  industry  structure,  we  must  determine  three  characteristics  of  the  firms  in  the  industry.  The 
first  ch.iracteristic,  as  mentioned  in  the  previous  section,  is  the  spectrum  of  alternative  earnings  oppor¬ 
tunities  of  the  firms.  The  availability  of  Ihese  opportunities  may  lead  some  firms  to  withdraw  from  the 
development  stage  of  the  acquisition  process.  In  contrast,  the  opportunities  may  lead  other  firms  to 
participate  in  the  hope  that  resources  acquired  during  the  project  (skilled  workers,  experienced 
managers,  etc.)  may  be  useful  in  other  areas,  thus  increasing  the  follow-on  benefits  of  the  project.  The 
second  characteristic  is  the  range  of  risks  that  these  opportunities  present  to  the  firms,  which  may  lead 
some  firms  to  decline  to  participate  in  (he  development  phase  of  the  project.  The  third  is  the  firms' 
perceptions  of  future  government  actions  (such  as  future  projects,  design-to-cost  goals,  and  incentive 
parameters),  and  especially,  of  the  uncertainty  associated  with  these  actions.  It  has  been  posited  that  a 
major  blocking  factor  in  industrial  innovation  is  industry  perception  of  (he  uncertainty  in  future  govern¬ 
ment  regulations  and  specifically  that  "the  uncertainty  of  federal  requirements,  rather  than  their 
stringency,  was  perceived  as  the  most  important  blocking  factor,"*  Thus,  government  parameter  setting 
behavior,  established  during  a  sequence  of  projects,  may  induce  perceptions  and  uncertainties  about 
future  government  actions  that  will  influence  significantly  the  attractiveness  of  defense  contracting  to 
individual  firms  and  thereby  the  structure  of  the  defense  industry. 


'Sec  Vlyorv  anil  .S«ee/i  page  29.  for  ihc  (junlalitm. 


VOL  29.  NO  I.  march  1982 


naval  research  logistics  QUARTiRLY 


18 


R  W  BLANNING,  P  R  KLEINDORFER  AND  C  S  SANKAR 


ACKNOWLEDGMENT 

The  authors  gratefully  acknowledge  the  helpful  comments  of  Marvin  Denicoff  ol  the  Office  of 
Naval  Research  (ONR)  on  an  earlier  draft  of  this  paper.  This  research  was  carried  out  under  Contract 
N00014-77-C-0171  of  ONR. 


REFERENCES 

111  "Application  of  Design-to-Cost  Concept  to  Major  Weapon  Systems  Acquisitions,"  Report  to  the 
Congress  by  the  Comptroller  General  of  the  United  States,  U.S.  General  Accounting  Office, 
Washington,  D.C.,  June  23,  1975. 

[2]  Bonin,  J.P.,  "On  the  Design  of  Managerial  Incentive  Structures  in  a  Decentralized  Planning 

Environment,"  The  American  Economic  Review,  66,  4,  682-687  (September  1976). 

[3]  Canes,  M.E.,  "The  Simple  Economics  of  Inventive  Contracting;  Note."  The  American  Economic 

Review,  65,  3,  478-483  (June  1975). 

[4]  Cummins,  M.T.,  "Incentive  Contracting  for  National  Defense:  A  Problem  of  Optimal  Risk  Shar¬ 

ing,"  The  Bell  Journal  of  Economics,  8,  1,  168-185  (Spring  1977). 

[5]  Defense  Management  Journal,  "Design  to  Cost,"  Special  Issue,  70,  4,  1-40  (September  1974). 

[6]  Groves,  T.,  "Incentives  in  Teams,”  Econometrica,  41  4,  617-631  (1973). 

[7]  Groves,  T.,  "Information  Incentives  and  the  Internalization  of  Production  Externalities,"  in  Theory 

and  Measurement  of  Economic  Externalities,  S.  Lin,  Editor  (The  Academic  Press,  New  York, 
1976). 

[8]  Harris,  M.  and  A.  Raiv,  "Some  Results  on  Incentive  Contracts  with  Applications  to  Education  and 

Employment,  Health  Insurance,  and  Law  Enforcement,"  The  American  Economic  Review,  68, 
1,  20-30  (March  1978), 

[9]  Jennergren,  P.L.,  "On  the  Design  of  Incentives  in  Business  Firms— A  Survey  of  Some  Research," 

Management  Science,  26  2,  180-201  (1980). 

[lOl  Kleindorfer,  P.R.  and  M.R.,  Sertel,  "Profit-Maximizing  Design  of  Enterprises  through  Incentives," 
Journal  of  Economic  Theory,  20,  3,  318-339  (June  1979). 

(Ill  McCall,  J.J.,  "The  Simple  Economics  of  Incentive  Contracting,"  The  American  Economic  Review, 
60,  5,  837-846  (December  1970). 

[12]  McCullough,  D.J.,  "Design  to  Cost  Problem  Definition,  Survey  of  Potential  Actors  and  Observa¬ 
tions  on  Limitations,"  Paper  P-928,  Institute  of  Defense  Analysis,  Washington,  D.C.,  January 
1973. 

[131  Myers,  S.  and  E.E.  Sweezy,  "Why  Innovations  Falter  and  Fail;  A  Study  of  200  Cases,”  Report  R 
75-04,  Denver  Research  Institute,  University  of  Denver,  January,  1976. 

[14]  Office  of  Management  and  Budget.  Circular  A-109,  April,  1976. 

[15]  Ross,  S.A.,  "On  the  Economic  Theory  of  Agency  and  the  Principle  of  Similarity,"  in  Essays  on 

Economic  Behavior  under  Uncertainty  M.  Balch,  D.  McFadden,  and  S.  Wu,  Editors  (North  Hol¬ 
land,  Amsterdam,  1974). 

[16]  Tashjian,  M.T.,  "Implementation  of  the  Design-to-Cost  Concept  from  the  Contractual  Point  of 

View,"  Defense  Management  Journal,  10,  4,  8-17  (September  1974). 


NAVAL  RE.SFARCH  LOOLSTICS  QUARTERLY 


VOL.  29.  NO.  1,  MARCH  1982 


OPTIMAL  STRATEGIES  IN  A  GAME  OF  ECONOMIC  SURVIVAL 

Karl  Borch 

Norwegian  School  of  Business 
Bergen-Sandviken,  Norway 

ABSTRACT 

The  games  of  economic  survival  introduced  by  Shubik  and  Thompson  seem 
tailor-made  for  the  analysis  of  some  problems  in  insurance  and  have  found 
many  applications  in  this  industry.  The  optimal  strategy  in  such  games  may  be 
a  so-called  "band  strategy."  This  result  seems  counter-intuitive  and  has  caused 
some  puzzlement  This  paper  gives  sufficient  conditions  so  that  the  optimal 
strategy  will  be  of  a  simpler  form,  and  it  is  argued  that  these  conditions  are 
satisfied  in  most  applications  to  insurance. 


INTRODUCTION 

The  term  "economic  survival  game"  is  really  another  name  for  the  classic  problem  of  the 
gambler’s  ruin  which  can  be  traced  back  to  Pascal.  The  new  term  is  usually  associated  with  Shubik  and 
Thompson  [11],  who  extended  some  of  the  classic  results  to  games  of  strategy.  This  generalization  in 
itself  may  not  have  been  particularly  fruitful,  but  it  led  to  an  increased  interest  in  the  multi-period 
one-person  game  against  nature.  It  is  the  starting  point  of  the  papers  by  Miyasawa  [7]  and  Morrill  [9], 
and  the  general  ideas  are  behind  the  book  by  Dubins  and  Savage  [4].  A  few  years  earlier  De  Fineit'  [3] 
presented  a  similar  model,  as  a  generalization  of  the  older  actuarial  theories,  based  on  the  probability 
that  an  insurance  company  shall  become  insolvent  or  "ruined." 

Miyasawa  and  Morrill  both  show  that  the  optimal  strategy  can  be  a  "band  strategy"— a  concept  to 
be  explained  in  the  next  section.  Strategies  of  this  form  are  not  intuitively  appealing  and  do  not  seem 
to  agree  well  with  observations  when  the  model  is  applied  to  situations  in  real  life,  but  they  have  been 
studied  for  their  mathematical  interest  by  several  authors  including  Hallin  [6].  The  purpose  of  this  note 
is  to  show  that  a  band  strategy  can  be  optimal  only  in  fairly  exceptional  cases. 

The  presentation  is  given  in  terms  of  insurance,  which  was  the  application  Morrill  [8]  originally 
had  in  mind.  This  interpretation  was  also  the  starting  point  of  De  Finetti,  and  his  models  have  been 
developed  and  discussed  extensively  in  actuarial  literature,  by,  among  others,  Borch  [1],  Buhlmann  |2l 
and  Pentikainen  [10].  It  is,  of  course,  possible  to  give  other  interpretations  to  the  model. 

THE  PROBLEM 

De  Finetti’s  starting  point  is  an  insurance  company  which  underwrites  identical  portfolios  in  each 
consecutive  operating  period.  Let  the  portfolio  be  characterized  by: 

P  =  premiums  received  in  an  operating  period. 

X  =  claims  paid  during  the  period,  a  stochastic  variable  with  the  distribution  f  (.v). 


J 


VOL.  29,  NO  I,  MARCH  1982 


19 


NAVAL  RESHARCH  KKilSTlCS  O'  ARII  Rl  N 


20 


K.  BORCH 


Lei  S,  be  the  company’s  equity  capital  or  "surplus"  at  the  end  of  period  r.  Clearly,  .S,  follou-,  a  random 
walk  defined  by; 

S,+  |  =  5,  +  P  -  X. 

In  insurance  it  is  natural  to  assume  that  if  S,  <  0  the  company  is  ruined  and  is  noi  allowed  to 
operate  in  the  following  periods. 

At  the  end  of  period  f,  the  company  may  consider  paying  a  dividend,  s,  6  (0.  .S',).  I  he  j>roblem  is 
then  to  find  the  optimal  dividend  policy.  De  Finetti  studied  different  assumptions  about  the  compan>  s 
objectives.  One  of  these  was  that  the  company  would  seek  to  maximize  the  expected  discounted  sum 
of  the  dividend  payments: 

{ 1 )  £■  j;  v's, 

^-0 

where  v  €  (0.1)  is  the  discount  factor. 

Let  F(S)  denote  the  maximum,  i.e.,  the  expected  discounted  sum  of  the  dividend  payments  it 
the  initial  equity  capital  is  S  and  if  the  company  follows  an  optimal  dividend  policy.  It  is  easy  to  sec 
that  L(5)  must  satisfy  the  functional  equation: 

<2)  L(S)=  max  s  +  v  f  ViS  -  s  +  p  -  x)dF(x)\. 

Essentially,  early  dividend  payments  are  preferred,  but  a  high  dividend  implies  a  low  retained 
surplus  and  hence  a  high  probability  of  ruin,  i.e.,  that  the  payments  will  terminate.  One  would  there¬ 
fore  expect  that  an  optimal  s  exists  for  any  value  of  S. 

Equation  (2)  can  be  solved  by  standard  methods  of  dynamic  programming,  and  Miyasawa  and 
Morrill  have  shown  that  the  optimal  dividend  policy  is  of  the  form: 


o 

It 

for 

5  <  z, 

s  -  5  ~ 

for 

Zi  <  5  <  Z 

5  =  0 

for 

Zj  <  5  <  2 

II 

1 

^2n+l 

for 

^2n+t  < 

A  policy  of  this  form  defined  by  a  set  of  2n  +  1  numbers,  is  called  a  "band  strategy"  by  Morrill 
The  special  case  with  n  =  0  he  calls  a  "barrier  strategy." 

A  band  strategy  implies  that  a  company  may  omit  or  reduce  its  dividend  after  a  successful  operat¬ 
ing  period,  and  this  seems  to  be  contradicted  by  the  business  behavior  one  can  obser\c.  The  harrier 
strategy  does  not  invite  the  same  objections.  Essentially,  it  implies  that  the  company  decides  that  it  is 
optimal  to  maintain  a  certain  equity  capital  Z.  No  dividend  is  paid  if  the  equity  capital  is  below  this 
level.  If  the  equity  capital  exceeds  Z,  the  excess  is  paid  out  as  dividend  immediately.  This  seems  rea¬ 
sonable  and  suggests  that,  in  normal  cases,  a  barrier  strategy  may  be  optimal. 

Assume  now  that  the  company  decides  to  adopt  a  barrier  strategy  defined  by  a  number  /,  not 
necessarily  optimal.  The  expected  discounted  value  of  the  dividend  payments  under  this  policy  is 
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(3)  K(5.  Z)  =  V  K(S  +  P  -  X  Z)  dF(x). 

-J  0 

If  a  density,  f(x)  =  F'(x),  exists,  this  is  an  integral  equation  with  the  boundary  conditions 
1^(5,  Z)  =  0  for  5  <  0  and  ^  (5.  Z)  =  S  -  Z  +  K(Z,  Z)  for  S  ^  Z. 


The  equation  has  a  unique  continuous  solution  for  any  nonnegative  value  of  Z.  Morrill's  result 
implies  that  the  solutions  in  Z  to  the  equation 

Z) 


(4) 


BZ 


=  0 


are  independent  of  5.  A  barrier  strategy  will  be  optimal  only  if  (4)  has  a  unique  root  in  Z. 

A  SOLUTION  OF  THE  PROBLEM 


The  integral  equation  (3)  can  be  solved,  and  it  is  possible  to  study  the  roots  of  (4)  by  this  direct 
approach.  It  seems,  however,  more  convenient  to  take  a  roundabout  way  and  consider  a  rather  naive 
dividend  policy.  Assume  that  at  the  end  of  a  profitable  operating  period  the  company  pays  the  whole 
profit  out  as  dividend  immediately.  If  the  period  has  given  a  loss,  no  dividend  is  paid. 

Let  11^(5)  be  the  expected  discounted  sum  of  the  dividends  paid  under  this  policy  when  the 
company’s  initial  capital  is  S:  The  function  IP (5)  is  clearly  a  solution  of  the  integral  equation: 
rP  rP  rS+P 

(5)  IP(S)  =  (P-A-)/(x)r&  +  vIP(S)  /(x)(ic  +  v  IP(S  +  P  -  .v) /(x  )<& 

which  evidently  is  far  simpler  than  the  functional  equation  (2).  Setting  the  first  term  on  the  right-hand 
side  in  (5)  equal  to  C,  and  rearranging  the  other  terms,  the  equation  takes  the  form 

(6)  {l-vP(P)lIP(S)  =  C  +  \  ^V(S-x) /(x  +  P)dx. 


Taking  the  Laplace  transform  of  (6)  one  finds: 


ll-vP(P)l  IP(S)e-'^rf5 


r 

1  (  r  °° 

V  1  W 

1  In  +  P)e  "dx 

•'0 

1 

Writing 

1  -  vP(P)  =  A 


and 

f  fix  +  P)e~‘^dx  = 
0 


the  equation  takes  the  form 


(7) 


W ix)e  “dbr  = 


C 

K  -  vdtii)  ' 


The  left-hand  side  of  (7)  can  be  written 

I  r°°  iVix)e-'^dx=  WiO)+  f”  WMe-'^dx. 

•To  Jo 

As  v</)(r)  <  <6(0)  =  1  —  FiP)  <  1  —  vP(P)  =  for  V  <  1,  it  follows  that  the  right-hand  side  of 
(7)  can  be  expanded  in  a  convergent  series.  Hence: 
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(8) 


r“  py'(x)e-‘^dx  +  M'(O)  =  S  T  (t>(l) 

n-O  ^ 


By  taking  S  =  0  in  (6)  one  sees  that  W'(O)  =  -p,so  that  (8)  can  be  written 

A 


(9) 


J”  W  Me-’^dx  =  W'W)  £ 


/r*  I 


^«(r) 


(<t>U))"  is  the  Laplace  transform  o|  the  «th  convolution  of  fix+p)  with  itself,  which  will  be  denoted 
by  f„(x).  Hence,  =  Jj,  P- y)dy  for  n  >  1,  and  let  by  definition 

/i(.v)  =  /(x  +  P). 


Taking  the  inverse  transform  of  (9),  one  obtains: 


(10) 


W(x)  =  M^(0)  £ 

/I-  1 


/„(x). 


From  (10)  IV (x)  itself  can  be  found  by  integration: 


^V(x)  =  (F(0)  £ 

fl— 0 


Fn(x). 


This  result  can  be  verified  directly.  To  show  this,  introduce  a  distribution  G(x)  defined  as  fol¬ 
lows: 

G(0)  =  F(P) 

G(x)  =  <  x  +  P)  for  X  >  0. 


Substitution  of  this  in  the  calculations  above  gives 
Wix)  =  C(14-  vG(x)  v^Gjfx)  ...I. 


G„(x)  is  the  probability  that  accumulated  losses  in  n  periods  shall  not  exceed  the  initial  equity 
capital  X.  If  this  event  occurs,  the  expected  discounted  value  of  the  next  dividend  payment  is  v'T. 

The  convolutions  G„(x)  have  a  complicated  form,  and  this  symbol  will  not  be  used  in  the  follow¬ 
ing. 


The  expansion  (10)  is,  as  noted,  convergent,  and  IV (x)  as  a  discounted  sum  of  distribution  func¬ 
tions  is  a  bounded  and  nondecreasing  function  of  x.  From  (6)  it  follows  that 


IV(0)  = 


_ C _ 

\-\r(p)  ■ 


Further,  it  is  easy  to  see  without  any  calculation,  that 


fV(oo) 


c 

1- V 


=  ^v'J^  iP  -  x)f{x)dx. 


The  equation  says  that  if  the  equity  capital  is  infinite,  the  company  will  never  be  ruined,  and  the 
expected  dividend  will  be  equal  to  expected  profit  in  all  future  periods. 
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From  (10)  it  follows  that 

)F'(0)  =  IF(O)-^  /(/») 
and 

fF'(oo)  =  0. 

A  SUFFICIENT  CONDITION  THAT  A  BARRIER  STRATEGY  IS  OPTIMAL 

Consider  now  the  equation 
(11)  B"(Z)  =  1. 

It  can  be  shown  that  the  solutions  of  (4)  and  (11)  coincide.  This  is  a  tedious  procedure,  and  one 
can  arrive  at  the  result  by  an  indirect  approach. 

If  Z  is  a  solution  of  (11)  and  if  the  company’s  capital  at  the  end  of  an  operating  period  is  Z  +  r 
with  c  >  0,  we  will  have,  at  least  for  z,  sufficiently  small 

W(Z  +  z)  <  W(Z)  +  z. 


Hence,  the  expected  discounted  sum  of  the  dividends  paid  will  increase  if  the  company  departs 
from  its  naive  policy  and  immediately  pays  the  amount  z  out  as  dividend. 

Similarly,  if  the  capital  is  smaller  than  Z,  it  will  be  optimal  to  retain  profits  until  the  capital  has 
been  brought  up  to  Z  before  paying  any  dividend. 

A  sufficient  condition  that  equation  (II)  has  a  unique  root  is  IF'(O)  >  1  and  iV"(x)  <  0. 

Differentiation  of  (10)  gives 

<x> 

From  the  definition  of  the  convolution,  it  follows  that: 

,/„(x)=J'^  ,/„_,(v)/(x +  F-y)c/v  for  n  ^  2. 

Differentiation  of  this  expression  gives: 

/;(x)  =  f„^,{y)f(x  +  P-y)  dy 

and  by  definition 

f\(x)  =  f'(x  +  P). 


(12)  H/''(x)  =  1F(0)  2^ 

n-\ 


Substitution  of  these  expressions  ini6  (12)  gives 

lF"(x)  =  W(Q)  ^  \  f'(x  +  P)  +  f{P),r{x  +  P) 

A  I  A 


+  M"(o)  y  ~ 
K 


n  / 


x  +  i  +  -jr  /(P)/„(x)  . 
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The  first  term  in  (13)  is  negative  if 

(14)  f'(x  +  P)  +  ^  f(P)f(x  +  P)  <  (i. 

K 


Substitution  of 

,/'(.v  +  P->)  =  -^f(P)f(x  +  P-y) 

K 

in  the  following  terms,  shows  that  they  all  vanish.  Hence,  (14)  is  a  sufficient  condition  that 
14'"(.v)  <  0, 


Condition  (14)  can  be  written 
fix  +  P)  ^ 


ftx  +  P) 


K 


f(P) 


and  this  gives 


f(x  +  P)  <  f{P)e  f(P)x. 


Hence,  equation  (11)  will  have  a  unique  root  if  the  claim  density  goes  to  zero  at  least  as  rapidly  as 
an  exponential. 

A  necessary,  but  not  sufficient  condition  is  that /'(jc  + /*)  <  0  for  .v  >  0.  For  a  unimodal  distri¬ 
bution  this  condition  will  be  satisfied  when  P  is  greater  than  the  mode.  The  typical  claim  distribution  in 
insurance  will  be  skew,  with  a  long  tail.  For  such  distributions  the  mean  will  be  greater  than  the  mode, 
and  the  premium  P  will  again  normally  be  greater  than  the  mean,  i.e.,  than  expected  claim  payments. 

If  the  necessary  condition  is  satisfied,  the  sufficient  condition  (14)  appears  as  a  mild  regularity 
condition  for  the  tail  of  the  distribution,  and  the  curious  cases  found  by  Miyasawa  and  Morrill  may  be 
dismissed  as  interesting  but  not  very  relevant. 

Clearly,  the  company  will  not  find  it  optimal  to  maintain  positive  reserves  unless 
fT'(O)  >  1. 


.Substitution  of  the  expressions  derived  from  (10)  gives  the  condition  in  the  form: 

\t  iP)  C  iP-x)  fix)dx  >  (1- vF(/’))T 

n 


Since  the  left-hand  side  will  go  to  zero  with  P,  it  follows  that  the  premium  must  be  above  a  cer¬ 
tain  level  to  induce  the  company  to  risk  any  part  of  its  equity  capital  as  reserves  in  the  underwriting 
business.  The  company  will,  however,  not  refuse  to  underwrite  any  portfolio,  since  it  cannot  lose 
unless  it  put  some  of  its  own  money  at  risk. 


SPK(  lAT  CASE 

As  a  simple  special  case  assume  that  the  claim  density  is  exponential,  i.e., 
/(x)  =  e'. 
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Substitution  in  the  general  formula  gives: 


c 

-p 

(1- v)x 

1- V 

1  —  v  +  ^ 

1  —  V  +  ve’’ 

and 


W’(x) 


t\e 
(1  —  v+ 


exp 


(1- v)x 
1  —  v  + 


It  is  easy  to  see  that  H^"(x)  <  0,  and  hence  that  a  barrier  strategy  is  optimal. 

Equation  (11)  which  determines  the  optimal  strategy  takes  the  form 

- — ^ ^  „  Z  =  lnC  +  lnv-/*— 2  1n(l  -  v+  vc’'’), 

1  —  V  +  VC  ^ 

or  by  substituting  the  expression  for  C  and  writing  v  =  (1  + 

„  =  ln(/>-  1+c-^)  -  2  In  (1  +  /)  -  P  -  2  In 


The  table  below  gives  the  value  of  Z  for  some  selected  values  of  P  when  i  =  0.1 . 


Premium  P 

Optical  Reserve  ==  Z 

Ruin  probability 

1.0 

Negative 

— 

1.2 

Negative 

— 

1.3 

0.166 

0.23 

1.4 

1.052 

0.086 

1.5 

1.42 

0.054 

2.0 

2.28 

0.014 

3.0 

2.14 

0.0059 

4.0 

1.47 

0.0042 

7.0 

0 

0.0009 

The  table  shows  that  increasing  P  will  increase  the  quality  of  the  insurance  contracts,  i.e.,  reduce 
the  probability  of  ruin.  It  should  not  be  surprising  that  Z  will  decrease  with  increasing  P  for  large 
values  of  P.  If  the  level  of  premiums  is  very  high,  there  will  be  a  very  low  probability  that  the 
underwriting  shall  lead  to  a  loss,  and  the  incentive  to  maintain  additional  reserves  will  be  reduced. 

RELEVANCE  TO  INSURANCE 

The  barrier  strategy  may  be  less  objectionable  than  the  band  strategy,  but  it  still  does  not  agree 
very  well  with  the  observations  one  can  make  of  dividend  policies  in  real  life.  If  an  insurance  company 
has  had  an  exceptionally  profitable  period,  it  will  usually  pay  out  only  a  part  of  the  profits  as  dividend. 
Sometimes  this  is  justified  by  an  explicit  statement  to  the  effect  that  it  is  desirable  to  retain  some  profits 
in  order  to  safeguard  future  dividend  payments.  This  clearly  contradicts  the  assumption  that  the 
company’s  objective  is  to  maximize  (1). 

Even  if  the  company  and  its  shareholders  seem  to  agree  that  the  objective  should  be  to  maximize 
an  expression  of  the  form  (1),  one  may  observe  apparent  departures  from  the  barrier  strategy.  This 
may  occur  just  because  the  model  is  too  simple  to  give  an  adequate  representation  of  the  complex  real 
world,  but  a  number  of  other  possible  explanations  suggest  themselves,  among  other  things: 
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(i)  It  may  be  considered  fair  that  the  insured,  i.e.,  the  policyholders,  should  receive  some 
benefits  after  a  very  profitable  period.  One  way  of  arranging  this  would  be  to  increase  the  company's 
reserves,  and  thus  reduce  the  probability  of  ruin  in  the  next  period,  i.e.,  improve  the  security  of  the 
policy  holders.  In  some  countries  the  governments  seem  to  try  to  induce  insurance  companies  to  lake 
this  attitude  and  maintain  a  conservative  dividend  policy. 

(ii)  The  managers  of  an  insurance  company  may  rightly  or  wrongly  believe  that  they  will  be 
blamed  for  sharp  reductions  in  dividend  payments.  Their  natural  reaction  may  then  be  to  argue  in 
favor  of  a  conservative  dividend  policy  which,  incidentally,  will  increase  the  expected  life  of  the  com¬ 
pany  and,  hence,  the  job  security  of  managers  and  other  employees. 

One  will  get  a  smoother  sequence  of  dividend  payments  by  assuming  that  the  company’s  objective 
is  to  maximize 

(IS)  Xv'-ls,) 

l-U 

where  u(s)  is  a  concave  utility  function. 

This  generalization  of  De  Finetti's  model  has  been  studied  by  several  authors,  by,  among  others, 
Hakansson  [5],  who  finds  that,  for  a  particular  class  of  utility  functions,  the  optimal  strategy  is  of  the 
form  s  =  ^S.  This  means  that  at  the  end  of  each  operating  period  the  company  pays  out  a  fraction,  9. 
of  its  equity  capital  as  dividend,  and  represents  behavior  in  conformance  with  observations.  The  objec¬ 
tions  to  criterion  function  (15)  are  essentially  of  a  theoretical  nature.  If  the  company  should  reduce  the 
scale  of  its  operations,  or  stop  selling  insurance  altogether,  it  would  find  it  optimal  to  pay  out  its  equity 
capital  in  an  infinite  decreasing  sequence.  This  might  make  sense  in  the  consumption  plans  discussed 
by  Hakansson  but  not  for  a  dividend  policy. 

The  discount  factor  v  which  appears  in  the  formula,  should  be  based  on  an  interest  rate  which 
represents  a  pure  risk  premium.  The  company's  reserves  can  be  invested  to  earn  a  return,  which  could 
be  paid  directly  to  the  shareholders  as  owners  of  the  capital.  Theoretically,  they  could  even  keep  the 
capital,  provided  that  they  accept  liability  for  possible  losses  in  the  insurance  operations.  Government 
regulations  will  usually  require  that  the  reserves  should  be  kept  in  low-risk  assets,  which  also  must  be 
of  fairly  liquid  nature,  since  the  reserves  can  be  called  upon  at  short  notice  to  pay  claims.  This  means 
that  the  rate  of  return  on  the  reserves  will  be  modest,  probably  close  to  the  risk-free  market  rate  of 
interest.  The  shareholders  risk  their  capital  as  reserves  in  the  insurance  company  and  will  require  a 
higher  expected  return.  This  interest  differential  seems  to  be  the  one  to  use  for  determining  the 
discount  factor. 
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ABSTRACT 

The  objective  of  a  diagnostic  analysis  is  to  provide  a  measure  of  perfor¬ 
mance  of  an  existing  system  and  estimate  the  benefits  of  implementing  a  new 
one.  if  necessary.  Firms  expect  diagnostic  studies  to  be  done  promptly  and 
inexpensively.  Consequently,  collection  and  manipulation  of  large  quantities  of 
data  are  prohibitive.  In  this  paper  we  explore  aggregate  optimization  models  as 
tools  for  diagnostic  analysis  of  inventory  systems.  We  concentrate  on  the 
dynamic  lot  size  problem  with  a  family  of  items  sharing  the  same  setup,  and  on 
the  management  of  perishable  items.  We  provide  upper  and  lower  bounds  on 
the  total  cost  to  be  expected  from  the  implementation  of  appropriate  systems 
Howeier.  the  major  thrust  of  the  paper  is  to  illustrate  an  approach  to  analyze 
inventory  systems  that  could  be  expanded  to  cover  a  wide  variety  of  applica¬ 
tions  A  fundamental  by-product  of  the  proposed  diagnostic  methodology  is  to 
identify  the  characteristics  that  items  should  share  to  be  aggregated  into  a  single 
family. 


1.  INTRODICTION 

InvcrUory-control  theory  provides  a  wide  variety  of  models  to  manage  effectively  products  with 
different  chiiracteristics.  Extensive  surveys  are  available  that  culminate  in  a  taxonomy  identifying 
specific  decision  rules  to  manage  inventories  under  a  number  of  conditions.  (Silver  [71,  Nahmias  [.*'1. 
and  .Aggarwal  (I).)  A  very  legitimate  concern  of  many  managers  today  is  to  understand  the  extent  to 
which  these  models  could  contribute  to  the  enhancement  of  the  performance  of  their  current  inventory 
system  C  onsequently,  management-science  practitioners  are  frequently  faced  with  requests  to  estimate 
the  benefits  for  improving  the  performance  of  such  systems.  Firms  expect  that  the  estimation  process 
in  iiself  will  not  represent  a  major  project  and  consume  a  considerable  amount  of  human  and  financial 
resources;  the  analysis  must  be  reasonably  accurate  and  inexpensive. 

Two  basic  approaches  have  emerged  to  comply  with  those  requirements.  One,  the  statistical 
approach,  is  based  on  analyzing  the  inventory  performance  on  a  relatively  small  sample  of  items  and 
generating  inferences  that  can  be  applied  to  the  whole  population.  The  other  approach,  based  on 
optimization  techniques,  consists  in  devising  simple  models  to  estimate  the  benefits  of  implementing 
the  proposed  system. 

The  simplified  models  are  derived  by  aggregating  items  into  families,  thereby  reducing  the  data 
collecTion  and  computation  efforts.  The  ultimate  objective  is  the  generation  of  bounds  relating  the 
aggregated  model  to  the  original  problem.  Zipkin  [12]  and  Evans  [3]  have  developed  such  bounds  for 
linear  and  transportation  models,  respectively. 
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The  management-science  literature  has  rarely  addressed  the  identification  of  problems  and  the  a- 
prinn  measurement  of  benefits  to  be  derived  from  the  implementation  of  a  proposed  new  system. 
Recently  Hax.  Majiuf  and  Pendrock  [4],  and  Wagner  [9]  have  stressed  the  importance  of  developing 
diagnostic  analysis  tools.  In  particular  Hax,  Majiuf  and  Pendrock  [4]  report  the  results  of  a  diagnostic 
analy  sis  of  a  large  logistics  system. 

In  [21  we  illustrated  how  statistically-based  techniques,  like  clustering,  inference  and  sampling, 
may  be  used  in  diagnostic  studies.  In  this  paper  we  focus  on  the  application  of  aggregate-optimization 
models.  In  Section  2  we  concentrate  on  the  dynamic-lot-size  problem  with  a  family  of  items  sharing 
the  same  setup.  We  examine  an  aggregation  scheme  and  compute  upper  and  lower  bounds  on  the  total 
cost  to  be  expected  from  the  implementation  of  appropriate  systems.  We  also  provide  conditions  under 
which  the  solution  to  the  aggregate  problem  solves  the  original  problem.  In  Section  three  we  analyze 
the  management  of  perishable  items.  An  aggregate  version  of  the  newsboy  problem  is  studied  and 
bounds  are  determined.  Conclusions  and  topics  for  future  research  are  discussed  in  the  last  section. 

We  would  like  to  emphasize  that  although  this  paper  examines  two  classes  of  inventory  models, 
our  aim  is  to  propose  a  general  approach  to  design  diagnostic  methodologies  that  could  be  easily 
extendable  to  a  wide  spectrum  of  optimization  models.  Moreover,  we  hope  that  the  result  of  our 
analysis  not  only  contributes  to  the  development  of  quantitative  measures  to  evaluate  inventory  perfor¬ 
mance,  but  also  provides  insights  into  the  conditions  that  have  to  be  met  by  the  items  to  become  candi¬ 
dates  for  aggregation  into  a  family  unit.  This  last  result  could  contribute  to  the  important  subject  of 
aggregate  planning. 

2.  I.OT  SIZE  PROBLEMS:  A  DIAGNOSTIC  ANALYSIS 

In  this  section  we  develop  diagnostic  procedures  to  estimate  ordering  and  inventory  carrying  costs 
associated  with  the  operation  and  management  of  a  family  of  items  replenished  in  lots.  The  objective 
of  this  procedure  is  to  obtain  estimates  of  total  costs  while  reducing  as  much  as  possible  the  massive 
manipulation  of  data  and  the  computational  requirements.  We  derive  error  bounds  for  aggregate  prob¬ 
lems  and  rules  indicating  how  to  aggregate  the  items  in  a  family  to  minimize  such  errors. 

We  address  practical  situations  in  which  all  items  in  a  family  are  replenished  simultaneously  and 
share  the  same  ordering  costs. 

Whenever  the  items  in  the  family  satisfy  the  conditions  underlying  the  economic  order  quantity 
system  [6l,  the  total  annual  cost  of  ordering  and  holding  the  family  in  stock  is  easily  shown  to  be 

(2.1)  7'r  =  s/25> 

where  .S',  r,  D,  and  v,  are,  respectively,  the  ordering  costs,  the  inventory  carrying  factor,  the  annual 
demand  of  item  /,  and  the  unit  cost  of  item  i.  The  cost  of  holding  in  stock  one  unit  of  product  i  for 
one  year  is  rv,.  Therefore,  (2.1)  can  be  rewritten  as 

TC  =  (production  cost  of  family) -JlSr. 

In  this  case  total  ordering  and  holding  cost  for  the  family  of  items  can  be  readily  computed  from 
aggregate  data.  However,  when  the  assumptions  underlying  the  economic  order  quantity  system  differ 
significantly  from  the  actual  conditions,  the  estimation  of  the  costs  must  be  based  on  more  complex 
models. 

In  what  follows  we  assume  that  the  demand  for  each  item  in  the  family,  in  each  period  1,2,  ...  T, 
IS  deterministic.  The  inventory  control  problem  then,  is  to  solve  the  following  mixed  integer  program¬ 
ming  problem: 
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T 

N 

N  N  1 

(P) 

zp  =  min  fix)  =  ^ 

/=i 

S,8 

»-l 

1-1  /-I  T»I 

subject  to 


£  (x,.,  -  c/,,)  ^  0  /  =  1.2 . T-,i=\,2,  ....  N. 

r-l 

X,  ^  ^  0  T  =  1,2,  . . .  ,  T  \  i  =  1.2 . /V. 

*.  if  X  ^  ® 

(-1 

0.  if  £  X,,  <  0 
y-l 

/  =  1.2 . /■;  /•  =  1,2 . N. 


x,, 


where  p, ,,  4,c  arid  are,  respectively,  the  unit  cost,  holding  cost,  demand  and  amount  ordered 

for  item  /  in  period  r,  and  the  ordering  cost  in  period  i.  Without  loss  of  generality,  we  assume  that  the 
initial  inventory  is  zero. 


We  will  compare  problem  (P)  with  the  solution  obtained  from  the  following  aggregate  problem: 


{AP) 
subject  to 


Zap  =  min  'P(y)  =  £ 


f-i 


5,80',)  +  P,yi  +  h, 


£  0,  -  >  0  r  =  1,2 . T 

r-l 


y,  ^  0  T  =  1.2 . T 


8  0',) 


1.  ify,  >  0 
0,  if  y,  <  0 


f  =  1,2 . T 


where  /?,,  h,,  d,,  y,  and  S,  are  the  unit  cost,  holding  cost,  the  demand  and  amount  ordered  for  the  fam¬ 
ily  in  period  i,  and  the  ordering  cost  in  period  r.  We  assume  that  y,  and  d,  are  defined  as  follows  for 
conveniently  selected  constants  )c,  >  0,  i  =  1,2 . yV; 


,v 


(2.2) 

y,  =  L 

y  =  1,2,  .. 

.  .  T.  y  =  1.2.  .. 

.  .  N. 

(2.3) 

y  =  1,2,  .. 

.  .  r,  y  =  1.2,  .. 

.  .  N. 

Therefore,  if  x,,  /  =  1,2 . N-,  i  =  1.2 . T  is  feasible  in  (P),  y,  /  =  1, 2 . T  defined 

as  in  (2.2),  is  feasible  in  (AP).  Wagner  and  Whitin  (10)  provide  an  efficient  dynamic  programming 
algorithm  to  solve  (P)  and  (AP). 
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The  expression  below,  for  a  pair  (x.  y)  satisfying  (2,2)  and  (2.3),  is  useful  in  our  development. 


(2,4) 


,l(x)  -  ^(y)  =  ^  S, 


r-l 


t. 


+  I 


L  f'u  Z  ~  ~  +  51  ~  P'^' 

/-I  7=\  r=I  /=1 


T  S  i  T  ^ 


“  Z  Z  51  ^'''g  ~  ,  -  d,  ,)  +  51  Z  -  k,p,). 

/=U=1  7=1  /=1  /=1 

The  second  equality  follows  from  (2.2),  (2.3)  and  the  fact  that 


s 


=  8  (y, )  r  =  1 , 2 .  T. 


PROPOSITION  2.1  (Theorem  6  in  [81):  iP)  and  (AP)  have  at  least  one  optimal  solution  of  the 


form 

(2.5) 

£  (.v,,,-4..)-v,,,.,  =  0  r=  1,2 . 7-;/=  1,2,  .. 

. ,  ^ 

(2.6) 

z  -  0  /  =  1.2 . T. 

r»l 

Note  that  (2.5)  and  (2.6)  characterize  the  extreme  points  in  (P)  and  iAP).  respectively.  The 
proposition  indicates  that  in  order  to  compare  zp  and  it  is  sufficient  to  consider  feasible  solutions  of 
(P)  and  iAP)  such  that  in  each  period,  either  the  initial  inventory  is  zero,  and  eventually  an  order  is 
placed,  or  the  initial  inventory  is  positive  and  no  order  is  placed. 

PROPOSITION  2.2:  Let  .v,  t  =  1,2 . T  be  feasible  in  (AP)  and  assume  it  satisfies  (2.6). 

Then,  there  is  a  unique  set  x,  ,  r  =  1,  2 . T;  /  =  1, 2 . N  feasible  in  (A)  such  that  y,  and  x,, 

are  related  as  in  (2.2)  and  x,  ,  satisfies  (2.5). 

Whenever  x,  ,  and  y,  (=1,2 . /V,  r  =  1, 2 . T  are  as  in  Proposition  2.2  we  will  say  that 

they  are  corresponding  feasible  extreme  point  solutions. 

Unfortunately,  ify,  r  =  1,2 . T  is  an  optimal  extreme  point  in  iAP)  the  corresponding  feasi¬ 

ble  extreme  point  solution  in  (P)  need  not  be  optimal,  in  general,  in  that  problem.  In  fact,  it  is  not 
even  true,  in  general,  that  the  optimal  value  of  one  of  the  problems  is  a  lower  or  upper  bound  of  the 
other.  Two  instances  where  a  relation  between  zp  and  z^p  is  known  a  priori  are  given  in  the  next  two 
results. 

PROPOSITION  2.3:  If  k,,p,  and  /i,  are  chosen  such  that  p,  ,  —  Pfk,  ^  0,  h. ,  -  h,  k,  ^  0 
(P/.j  “  Pik,  ^  0,  h,j  -  h,k,  ^  0)  for  ail  i  and  all  r,  it  follows  that 

^  ('4/»  ^ 

PROOF:  Follows  by  Proposition  2.2  and  Expression  (2.4).  □ 

COROLLARY  2.1:  Assume  the  unit  and  holding  costs  p,,.  It,,  (  =  1,2 . N  and 

f  =  1,2 .  T  remain  in  the  same  proportion  for  all  /,  i.e.,  p,,  =  k,p,,  h,,  —  k,h,  for  some  k,  >  0, 

p,  >  0  and  h,  >  0.  Then,  zp  =  z^p  as  long  as  p,  and  h,  are  used  in  iAP)  as  the  underlying  unit  and 
holding  costs. 
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PROOF:  Follows  by  Proposition  2.3.  □  . 


A  special  case  of  Corollary  2.1,  of  interest  in  practical  settings,  is  when  the  unit  and  holding  costs 
are  constant  for  each  item  over  the  planning  horizon  and  the  holding  cost  is  proportional  to  the  unit 
cost.  In  these  situations  ft  ,  =  ft./t,.,  =  /i,  and  >i,  =  rp,  for  all  items  and  therefore  zp  =  Zy,p. 

Even  when  the  conditions  of  corollary  2.1  do  not  hold,  we  may  prefer  to  solve  (AP)  instead  of 
(P)  either  to  save  computational  work  or  because  family  demand  data  is  readily  available  or  even 
because  family  demand  can  be  forecast  with  greater  accuracy  than  item  demand,  /c,  and  h,  can  always 
be  chosen  to  satisfy  the  conditions  in  Proposition  2.3.  However,  the  gap  between  zp  and  z^p  can  be 
greater  than  desired.  In  what  follows  we  attempt  to  measure  this  aggregation  gap  under  quite  general 
conditions. 


In  the  remainder  of  this  section  we  assume  the  following; 

T  N  D, 

/c,  =  1,  A  =  I  4.,.  0=1  O,.  c,  = 


,V  N  M 

o,  =  X  d,r  d,j  =  c,D,  +  y),.,.  ft  =  £  ftp,,,  and  4=  £  ft^.j 
/- 1  1  t- 1 

where,  for  all  expressions 

I  =  1.2 . /V;  /  =  1.2 . T. 


Note  that 

f2.7)  X  =  i  ”  0 

for  all  i  and  t. 


Let  X  be  feasible  in  (P)  and  let 


0i(x)  = 


f :  I  <  f  <  r.  X  x,.,  >  0 


For  r  €  we  denote  by  nix.  r)  the  smallest  integer  in^(x)  U  (F  +  1|  larger  than  r.  i.e., 
nix.  r)  =  minlr  :r  >  r.  t  3tix)  U  (r  +  l)). 


J?(x)  and  nix.  r)  correspond,  respectively,  to  the  set  of  time  periods  when  any  production  occurs 
and  the  next  such  period  after  /. 

Assume  x  is  an  optimal  extreme  point  of  iP).  Then, 


T 

N 

1  T-l 

Zp  -  X 

Z 

+  I  I 

Pu  Z 

d,r+  Z  4,  Z  4« 

/-I 

i-1 

,€  »(.t)  i-I 

'  r-/ 

r-fFl  w-/ 

Let  >'  be  the  corresponding  feasible  extreme  point  solution  to  x  in  iAP)  then. 


-p  = 'Z  s,f)i.v,)  +  X  L 

/-I  ,<  <n.\>  I- 1 

=  X4fi<ft)+  X  i 

1  (x  >  I 


n(x.r^-i  r- 1 

X  A.,4r  +  L  4r  L 

r— f  r»f+l  M-f 

X  A.,<ftO,  + 'P,.,)  +  X  *ftO, +  v;,  ,)  X  4i/ 

r-,+l  u-l 
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=  £5,M.P,)+  £ 

'-I 


+  I  I 


1 

-  I 

P,D,+  I  D, 

T-/+  1 

n  <  .V.  /  >  -  1 


Pu  +  X 


>  z^p  +  <X>  (x ) 


where 


(2.8) 

cb(x)=  £  £ 

Pu'P, 

T-  1 

p,.,  +  L  h,.u 

li  ,<fG)  1-1 

T-/+1 

u^t 

We  have  just  proved 

PROPOSITION  2.4:  If  x  is  an  optimal  extreme  point  of  (P), 
Zp  ^  z^p  +  (t>(x). 


Similarly,  define  for  any  extreme  pointy  in  (AP) 
my)  =(/:!</<  T  y,  >  01 
and 

n(y,  r)  =  minir  :r  >  r,  r  €  U  {7"  +  111. 

Let  y  be  an  optimal  extreme  point  of  (AP)  and  x  be  its  corresponding  feasible  extreme  point  solution 
in  (P).  Then, 


T 

f  T-  1  ] 

T 

s  \ 

=  X  •^'**^'*  ■*■  Z 

P,D,  +  £  DAp,  +  £  fiJ 

= 

I  -Vuj 

'-1  liXiy) 

r 

r-/+!  (  w-f  J 

7-1 

s 

n(,v./)- 1 

r-  I 

+  I  z 

Pi.S^^u  -  u')  +  X 

Pu  +  X 

*~d,.r  -  'P,r') 

i-I 

r-/+l 

1/-/ 

(2.9) 

r(y)=  £ 

,v 

Z 

Pu^,. 

niy.l}-  1 

( +  Z 

T-  I 

Pt.i  ^  ^i.u 

/€  .^(v) 

/=l 

T=f+  1 

M=7 

Consequently, 

PROPOSITION  2.5:  If  y  is  an  optimal  extreme  point  of  (AP), 

Zp  ^  z^p  +  rij). 

COROLLARY  2.2:  If  y  and  x  are  optimal  extreme  point  solutions  of  (AP)  and  (P),  respectively, 

then 

<I>(.x)  ^  z/.  -  <  r(y). 

Note  that  xandy  need  not  be  corresponding  feasible  solutions  in  Corollary  2.2. 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


VOL.  29,  NO  I,  MARCH  1982 


A 


DIAGNOSTIC  ANALYSIS  OF  INVENTORY  SYSTEMS  35 

Corollary  2.1  shows  that  if  the  unit  and  holding  costs,  for  each  item  in  the  family,  remain  in  the 

,v 

same  proportion,  then  zp  =  z^p.  A  similar  result  holds  for  the  demands  even  when  h,  ^  c,h,  , 

and/or  p,  ^  Y. 

1-1 


COROLLARY  2.3:  Assume  that  =  0  /  =  1,2 . A,  t  =  1,2 . 7",  that  is,  </,, 

Then 


,A. 


-  ^AP- 


PROOF:  From  (2.8)  and  (2.9)  we  have  <l>(x)  =  r^y)  =  0  and  by  Corollary  2.1, 

0  ^  Zp  -  z^P  ^  0.  Hence,  zp  =  z^p.  □ 


Corollary  2.3  provides  an  interesting  interpretation  of  problem  (AP).  More  specifically,  it  shows 
that  solving  the  aggregate  problem  is  equivalent  to  solving  the  detailed  problem  with  the  demands  d,, 

A 

replaced  by  c,D,  =  —  D,.  In  fact,  under  the  conditions  of  the  corollary,  the  following  stronger  result 
holds: 


PROPOSITION  2.6:  Under  the  conditions  of  Corollary  2.3,  f(x)  =  'fiy)  for  every  pair  x.  y  of 
corresponding  extreme  points. 


PROOF:  Note  that^Ix)  =  ^(y)  and  nix.  t)  =  n(y,  l)  for  all  t  €  ^(x)  and^iy). 


fix)  =  Y  ^'8 

/-I 


s 

N 

n  (x,f)-  1 

T-  1 

£  Xu 

+  I  I 

PiA.t  +  £ 

£  ku  +  Pu 

d,r 

/-I 

V-/+1 

w-/ 

T  N 

n  (^.r)- 1 

\  ■ 

T-  1 

/ 

£S,6(j-,)-e£  Y 

P/./f/A  +  £ 

£  hi.u  +  Pu 

CiD, 

(-1  /-I  tiJUy) 

r-f+1 

W-f 

£S,8(j;,)+  T 

f-1  /€iTv) 


niy.f)-  I 
P,D,  +  Y 

r-f+1 


u^l 


D, 


=  'fiiy). 


We  should  note  here  that  the  conditions  of  Corollary  2.3  (proportionality  of  the  demands)  are 
equivalent  to  requiring  the  multiplicative  seasonal  factors  of  the  dilferent  items  to  be  equal  period  by 
period.  In  what  follows  we  prove  this  result. 


PROPOSITION  2.7:  Assume  that  we  fit  the  model  </,,,  =  Si,,(bo.i  +  to  every  item  i.  where 
boj  and  by,  are  determined  by  the  least  squares  method,  and  the  s,,  are  the  multiplicative  seasonal  fac¬ 
tors  defined  according  to  Winters  (111.  Then,  if  the  demands  are  proportional  as  required  in  Corollary 
2.3,  it  follows  that  5,  ,  =  j,,  for  all  /  =  1,2,  . . .  ,  A,  /  =  1,2,  . . .  ,  L  and  r  =  1,  2,  . . .  ,  T. 


PROOF:  For  each  of  the  A  items  the  offset  value  bo.i  and  the  linear  trend  by,  have  the  property 
of  minimizing  the  sum  of  squares  of  the  seasonal  variation: 


b,,= 


T 

Di 

I 

/-I 

du  -  -J 

(t  -  t) 

10 -fH 

r-l 


^  AY  iTd,,,-  Di)it-  t) 

t-) 


A 


*0./ - Y  ~  8i.> ' 
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where 


£(t~7)^ 

.  1  +  T 

A  =  - - - and  i - - — 

the  seasonal  factors  are 

t>0.,  +  t*!,,  ' 


Assuming  proportional  demands:  d,,  =  k^jd,,  /  =  1, 2 . N  t  =  \,1 . T”  for  a  generic  item  / 

and  a  fixed "  base”  item  /,  we  get 

du  =  +  tb\,)s,,  =  Si,(k,ibo,i  +  tk,|b^| 

but 


and 


r  _  T 

b\.,  =  L  -  D,)U  ~  t)^  A  Y,  ^Tk,jdu  -  k,jD,)U  -  t) 

j-i  /-I 


=  KjA  Y  ~  (t  -  i)  =  kub,  , 


i-\ 


Dj  —  kj  j  D[  — 

bo.i  —  ~Y  ~  b]  jt  =  Y  k,ib\  ,t  —  kjjbij  i. 


Therefore, 

du  ~  ^i.Ab^  i  "E  fb\  ,) 

which  implies: 

Si,,  =  Sii  i  =  \,2.  . . .  ,  N  I  =  \,2 . T.  □ 

When  the  conditions  of  Corollary  2.3  do  not  hold  it  is  possible  to  derive  bounds  for  the  gap 
between  Zp  and  z^p  as  we  show  in  the  following  development. 

Let  a,  be  defined  as  the  maximum  absolute  value  of  the  relative  difference  between  each  period 
demand  of  product  i  and  its  annual  demand,  and  the  corresponding  ratio  of  the  family,  i.e.. 


(2.10) 


Hence, 

(2.11) 


^  _  A 

D,  D 


a,  «  max 

/-1.2 . T 

d,., 

—  /  =  1,2 

. N. 

o 

A 

D, 

a,  =  max 

f-1.2 . T 

J  oA 

=  max 

1  (-1.2 . T 

^  di_,  - 

1  di,>0 

(-1.2 . r  d,, 

ra,,>n 
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a  =  max  a,  =  max  — '~:t  =  1,2, 

/-1.2 . N  4./ 


..  T.  r  :d„  >  0,  /  =  1,2, 


^  adi,  /  =  1, 2,  . . .  ,  jV,  /  =  1,  2,  . . .  ,  7"  and  d^  >  0. 


Note  that  when  d,  ,  =  0, ,  =  ~c,D,  <  0. 


For  any  extreme  point  x-  in  (F), 


»U/)-l  r-l 


|4>WI=  II  I  I  14..^,,, 


/€  jtix)  i-l  T-/ 


Af  n(x./)-l  t-1 

S  Z  Z  A.,y’,r+  I 

1-1  r=/  T-7+1  w  =  / 

T;rf,  ,>0  rrf  >0 

AT  /.  T  J 

A/  /i(x./)-I  n<x./)-l  r-l 

<  I  I  I  I  I/2,,J^,,J 

J^Kx)  1-1  T-r  T-/+1  u-r 

Tirfj  ^>0  r;</^  ^>0 

A'  rt(x./)- 1  nix.O-  1  r-  1 

^  22  I  I  I  A./4.r  ■*■  I  Z  "  fl[F/>(x)  +  Hp(x)] 

r— r  r— 


where  F’/.(x)  and  ///.(x)  are  the  proportional  purchasing  and  holding  costs  in  problem  (P)  associated 
with  the  feasible  point  x. 


Similarly,  for  any  extreme  pointy  in  (AP) 


S  n<r./l-  I 


f-1  I  r-l 


«(>,/)-  1  r-  I 


lr(y)l<22  III  A,Xr+  I  I/>,..4,r. 


However, 


<  \d,,r  “  =  IsP,  J  <  ad,  ,. 


Thus,  if  fl  <  1, 


d,r  < 

I  —  a 


Substituting  (2.14)  in  (2.13) 


Ar  /j(v,f)-I  r- 1 


lr0')l<Tf7  Z  Z  I.PuC.D,+  J:  £  I4..CA 

r— /  /«j  I  r— r+1  u-f 


lP,p{y)  +  //^p(y)] 


where  PAp(y)  and  H^piy)  are  the  proportional  costs  in  (AP)  corresponding  to  y. 
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Before  deriving  the  desired  bounds  we  observe  that  if  x  and  y  denote,  respectively,  optimal 
extreme  points  of  (P)  and  (-4P)  then  zp  and  z^p  can  be  written  as 


and 


Zp  =  Sp(x)  +  Pp(x)  +  Hp(x) 


^AP  =  ^AP^'f  +  Pap^"*  +  H^p(y) 


where  Sp(x)  and  S^ipiy)  are  the  fixed  costs  associated  with  x  and  y.  Also,  to  use  as  reference  value  in 
the  measurement  of  the  gap,  we  define 


V  =  y 

*■  max  ^ 
f=l 


■5,  +  X  Pu^U 


1-1 


+  max{/’/.(x)  +  Hpix):  x  feasible  in  (F)l  =  5^  +  (P^  + 


and 


’max  =  X  Pi^i^  max(P^p(y)  +  H^p{y):y  feasible  in  (/t/*))  =  S-^  +  (/’*'  + 


/-I 


Therefore,  by  Proposition  2.5  and  Expression  (2.15),  if  a  <  I 

^P  ~  ^AP  ^  _ |r  (y)| _ 

4ax  -  +  (/’^+  //•^)„ax  -  PAp(y)  -  H^p(y) 

^  _a _ H^p(y)  +  P^piy) _ 

I- o  s>--  s,p(y)  +  (p>+  //>)„ax  -  PAp(y'>  -  HAp(y)  ■ 


Since  (P^  +•  //>)„ax  >  PAP^y)  +  H^piy)  >  0  and 
S"  >  z^p  =  S,4/.(J)  +  P^piy)  +  H^p(y) 

it  follows  that 


(2.16) 


^p  ~  ^AP  ^  a 

r'’  -  r  1  - 

*max  ^AP  * 


Expression  (2.16)  provides  an  upper  bound  on  the  gap  between  the  optimal  values  Zp  and  z^p 
relative  to  provided  that  a  <  I .  Another  gap  measurement  often  employed  in  this  type  of  analysis 
is  obtained  considering  only  the  aggregate  problem  as  a  basis  for  comparison,  i.e.. 


(2.17) 


^p  -  ^AP  ^  |r(y)|  ^ 

^AP  ^AP 


\P4p(y)  +  H,p(y)] 
i  —  a _ _ 

S^piy)  +  Pap^v^  +  M4p{y) 


Other  upper  bounds  can  be  obtained  as  follows: 

j  ^AP  ~  ^p  ^  _ — <t>(x) _  ^  a [P p(x)  +  Hp(x)] 

^max  -  ^Ap'^  +  (P"  +  -  5p(x)  -  Ppix)  -  Hp(x)  "  S"  -  Spix) 


The  inequalities  in  (2.18)  are  implied  by  Proposition  2.4,  Expression  (2.12)  and  the  facts  that 
iP'  +  >  Ppix)  +  Hpix)  ^  0  and  S'  -  Spix)  >  Ppix)  +  Hpix). 


Similarly, 


(2.19) 


^AP  ~  ^P 
2P 


-tKx) 


S/>(x)  +  Ppix)  +  Hpix) 


^  a. 
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Lower  bounds  on  the  value  of  Zp  are  readily  obtainable  from  inequalities  (2.18)  and  (2.19). 
These  expressions  imply  (2.20)  and  (2.21)  below: 

(2.20)  Zp  ^  ; -  if  O  <  I 

1  —  a 

and 


(2.21) 


\  +  a' 


Note  that  if  the  production  costs  p,,  do  not  change  over  lime,  they  can  be  dropped  from  the 
objective  function  in  the  formulation  of  problems  (P)  and  (AP).  In  addition,  the  reference  costs 
and  can  be  redefined  as 

7 

“max  =  X  niax{A/,,(jf)  :x  feasible  in  (P)) 

(-1 

7 

“max  X  \H^p(y) : y  feasible  in  iAP)\ 

i-\ 

in  order  to  get  tighter  bounds. 

We  summarize  the  results  obtained  in  the  next  proposition. 

PROPOSITION  2.8:  The  relative  gap  between  zp  and  z^p  can  be  bounded  as  follows: 


(2.16) 

(1) 

ZTz  ~  ^At>  ^ 

.  ^  1 

a 

-  a  <  \ 

* 

a 

(2.17) 

(2) 

1 

n 

*35 

/A 

_ _ 

if  a  <  1 

Zap  1  - 

a 

Z4P  —  Zp 

(2.18) 

(3) 

X  <  a 

*max 

(2.19) 

(4) 

1 

N 

/A 

^p 

{  ^  \ 

^p  *“  ^AP  ^  a 

^AP  ^max  -c- 

\  L. ZZ) 

V  J  ) 

^  . 

it  0 

r.,/.  I  - 

a 

-AP 

(2.23) 

(6) 

^P  ~  -AP  ^ 

^  . 

a 

_ 

-AP  ^ 

+ 

a 

Expressions  (2.22)  and  (2.23)  are  equivalent  to  (2.20)  and  (2.21),  respectively. 

Proposition  2.8  indicates  that  (AP)  is  a  good  approximation  to  (P)  whenever  the  value  of  a  is 
"small "  When  all  data  on  the  demands  of  the  items  are  known,  the  value  of  a  can  be  computed 
exactly.  In  this  case  if  the  bounds  are  satisfactory  we  may  want  to  solve  (AP)  to  reduce  the  computa¬ 
tional  requirements.  However,  most  likely,  in  practical  settings  the  c/,,'s  are  not  known  exactly  and  a 
can  not  be  calculated.  Proposition  2.8  can  still  be  very  useful  in  these  occasions  because  the  value  of  a 
depends  only  on  the  maximum  relative  absolute  deviation  of  the  demands  of  the  items  and  of  the  fam¬ 
ily,  and  managers  are  often  able  to  estimate  bounds  for  such  a  value.  An  important  by-product  of  Pro¬ 
position  2.8  is  the  qualitative  insight  it  provides  for  aggregating  items  in  diagnostic  studies  of  the  nature 
discussed  in  this  paper. 
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Finally,  we  point  out  that  (2.18)  and  (2.22)  hold  if  replaced  by  an  upper  bound.  The  same 

is  true  with  (2.16)  with  respect  to  Also,  if  it  is  assumed  that  the  demand  of  item  /  has  no  trend, 

a,  can  be  seen  as  the  maximum  absolute  value  of  the  relative  difference  between  the  seasonal  factors  of 
the  item  and  the  seasonal  factors  of  the  family. 

3.  SINGLE  PERIOD  STOCHASTIC  DEMAND 

In  this  section  we  address  the  following  problem:  Given  a  family  of  N  items  with  single  period 
stochastic  demands,  determine  upper  and  lower  bounds  on  the  maximum  deviation  between  replenish¬ 
ing  the  items  individually  and  as  an  aggregate  family. 

The  following  conditions  below  are  assumed  to  hold  throughout  this  section: 

(i)  The  interval  of  time  with  nonzero  demand  is  the  same  for  all  items. 

(ii)  The  demand  of  each  item  is  independent  of  the  demand  of  the  other  items  and  is  nor¬ 
mally  distributed  with  mean  fx,  and  standard  deviation  o-,. 

(iii)  The  unit  under  stock  and  over  stock  costs  are  proportional  to  the  value  v,  of  the  item  and 
are  written  as 

over-stock  unit  cost  =  co,  =  Civ^  /  =  1.2 . N. 

under-stock  unit  cost  =  ci/,  =  C2V,  i  =  1.2 . N. 

In  order  to  determine  the  desired  bounds  it  is  necessary  to  establish  the  following  results. 

PROPOSITION  3.1:  Consider  the  problem 

A 

2  =  max  (T, 

i-i 

subject  to 


I- 1 


rr,  ^  0  (  =1.2 . N. 

Then 

2  =  V  N(T  . 

PROPOSITION  3.2: 

Consider  the  problem 

(W)  2  =  min  ^  (T, 

(-1 

subject  to 

i-l 

(T  ^  cr  I  ^  cf  ('  =  1 

where  we  assume  that  <z  <  (?.  When  2: 
nonnegative  numbers. 


.2 . ,V 

=  (7  (he  resolution  of  (  W  )  is  trivial,  cr^.  a  and  <f  are  known 
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Then, 

Z  =  +  A/2CT  +  y  (~Jcr^  —  A/|CT^  — 

where 


and 

N2  =  N  -  N^  -  y. 

[xl  and  IxJ  denote,  respectively,  the  largest  integer  less  or  equal  than  x  and  the  smallest  integer  greater 
or  equal  than  x. 

PROOF:  Problem  ( IT')  can  be  written  as 

z  =  min  X 
(-1 

subject  to 

I  T,  = 

y,  +  s,  = 

y,  ~  ‘i 

T,.  S,.  r,  ^  0 

The  objective  function  is  strictly  concave  and  the  feasible  set  is  compact.  Hence,  there  is  a 
unique  optimal  solution  which  is  an  extreme  point  of  F,  the  feasible  set  of  ( IT  ). 

Every  extreme  point  of  F  has  2/V  +  1  basic  variables.  Assume  that  >,i  and  >’,2  are  in  the  open 
interval  Then,  the  corresponding  slacks  s,|.  s,2,  and  1,2  strictly  positive.  We  are  left 

with  2N  +  I  -  6  =  2(.V  -  2)  —  1  variables  in  the  basis.  The  remaining  2{N  —  2)  bounding  constraints 
must  have  each  at  least  one  variable  in  the  basis.  However,  this  is  impossible  since  we  are  left  with 
2(N  -  2)  -  1  basic  variables.  Consequently,  at  most,  one  variable  y,  can  be  in  the  open  interval 
for  each  extreme  point  of  F.  Thus,  at  every  extreme  point  of  F,  y,  =  or  y,  —  for  all 
except  at  most  one  /.  Therefore,  the  minimum  cost  solution  will  have  as  many  variables  equal  to  as 

possible  while  still  satisfying  the  knapsack  constraint  ^  y,  =  cr^ 

I- 1 

The  optimal  value  of  (N)  is  of  the  form 
N^  components  equal  to 
Ni  components  equal  to  a ^  and 
y  components  in  the  open  interval  (o^^, 

where  /Vj.  N2  are  nonnegative  integers,  y  is  either  zero  or  one,  and  N  —  +  N2  +  y- 

From  the  knapsack  constraint  it  follows  that 
+  /V2<t^  +  -yr  — 


/•  =  1,2 . N 

;■  =  1,2 . A 

(•  =  1,2 . N. 
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where  T  €  Thus, 

+  i^2  ■y)5'^  ^  (T^ 

N^v^  +  (N  -  ^  ,T^ 


and 


(3.1) 


/V|  ^ 


z  can  be  written  as 


z  =  A/’iff  +  A(20’  +  yr. 


Consequently,  A/j  should  be  as  large  as  possible  in  the  optimal  solution.  From  (3.1), 


Note  that  if  — 

y  = 


— -  is  integer  then  y  =  0.  Otherwise  y  =  1.  Therefore, 

(T^ 


Let  k  =  - and  let  ^  be  a  normally  distributed  random  variable  with  mean  zero  and  standard 

C|  +  C2 

deviation  equal  to  one.  Denote  by  C*  the  value  of  {  such  that 
Prob(f  <{*.)  =  A-. 


It  is  well  known  [6]  that  the  optimal  lot  size  for  item  /  is 

(3.2)  Qt=^f^,+(k<rr 


The  total  under-stock  and  over-stock  cost  for  item  /  is 

(3.3)  TC,  =  cOi(Q*  —  fi,)  +  (co,  +  cuj)  (TiGiiif) 

where 

jr 

and  /(•)  is  the  normal  probability  density  function  16). 

Substituting  (3.2)  in  (3.3) 

TC,  =  co,^i,(T,  -I-  (co,  -I-  cu,)<r ,G (Ck^  =  (cif*  +  (c)  C2)G(ft)]ViO-,. 

The  total  cost  of  under-stock  and  over-stock  for  all  items  if  each  is  ordered  separately  is 
TC  =  £  TC,  =  (c,«*  +  (c,  +  C2)  C(C*)1  £  v,<r,. 

i-i  i-i 

Let 

^  =  CiC*  -E  (c|  +  C2)  C(C*). 
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Hence, 

(3.4)  TC  =  A/£  v.o-,. 

1-1 


Assume  the  N  items  are  aggregated  in  a  single  product  with  parameters  co  =  C|V,  cu  = 
v=  ^  V,,  and  o-^  =  ^  O'/.  The  demand  of  the  aggregate  product,  or,  equivalently,  the 

J— I  (—1  /“I 

family,  is  single  period  and  is  normally  distributed  with  mean  m  and  standard  deviation  o-. 

The  optimal  ordering  quantity  and  cost  for  the  family  are 

03  =  M  + 

and 

TC'^  =  M  vcr. 


Note  that  TC^  would  be  equal  to  TC  is  o-  was  equal  to  ^  o-, 


•  1 


We  propose  to  use  as  an  approximation  for  TC  the  value  TQ  defined  as 

I 


„  _1_ 
yfN 


TC^  =  TC’^  ~  ^  M  \<tN  2 . 


The  quality  of  the  approximation  can  be  measured  by  the  following  relative  error 


(3.5) 


M\<tN  ^  —  M  T  v.o-, 

1  ^  i/rr 


TC, 


M\crN 


1-1 


Given  a  finite  sequence  (xil,  let  x  and  x  denote,  respectively,  max,{x,),  and  min,(x,l,.  Define 


_  yVv  Ay  ^  VNa  „  VNo: 

=  — .  02  -  -7-.  - 02  =  -7— 


Note  that  a\  is  the  quotient  of  the  maximum  and  average  unit  value  of  the  /V  items.  Similar 
interpretations  hold  for  the  other  parameters.  Moreover, 

(3.6)  a,  ^  1,  flj  ^  1,  /3,  >  \.  I- 

From  (3.5), 


_ _  iV  VO- 

A  =  I  -  Vn  y  >  1 

^  V(r 


VNv  ^  , 


1-1 


nViv  J"' 


and  by  Proposition  3.1, 


(3.7) 


A  >  1 - ^  V77o-  -  1  -  fl,. 

(T-V /V 
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From  (3.5) 

(3.8)  A  =  1  -  VTV  T  >  1  -  =1-/3, 

v(r  o- 


(3.9) 


Upper  bounds  on  the  value  of  A  can  also  be  derived  from  (3.5): 

va  <7 


Similarly, 


.it  w  »  c  '  jjio- 


By  Proposition  3.2, 
(3.10)  A  <  1  -  ^72 


yfN  (T 


The  next  proposition  summarizes  the  results  obtained. 

PROPOSITION  3.3:  The  following  bounds  hold  for 

TQ  -  TC 

A  =  — = - 

TC, 

(1)  1  -  a,  A  <  1  -  02^^ 

(2)  1  -/3,  <  A  <  1  -  /32. 

The  bounds  on  A  depend  on  the  extremal  parameters  of  the  items  and  the  family  parameters. 
These  can  be  estimated  with  greater  accuracy  and  less  cost  than  the  items  parameters.  Moreover,  (1) 
and  (2)  in  Proposition  3.3  hold  with  a\,  ai,  /3|.  (82  replaced  by  respective  bounds.  Therefore,  the  com¬ 
putation  of  TC,  may  play  a  very  useful  role  in  the  diagnostic  analysis  of  a  family  of  items  with  single 
period  stochastic  demands. 

4.  CONCLUSIONS  AND  TOPICS  FOR  FUTURE  RESEARCH 

In  this  paper  we  have  developed  diagnostic  models  for  a  limited  class  of  inventory-control  prob¬ 
lems.  We  have  shown  the  effect  of  different  aggregation  schemes  and  their  power  to  estimate  the  per¬ 
formance  to  be  expected  from  the  implementation  of  a  recommended  system.  Upper  and  lower  bounds 
were  provided  for  that  purpose. 

A  very  important  part  of  a  diagnostic  analysis  is  the  evaluation  of  the  performance  of  different 
inventory-control  algorithms.  We  limited  our  analysis  to  consider  two  basic  situations:  a  case  where  the 
items  of  the  system  share  a  common  setup,  and  a  case  where  the  products  have  single-season- 
independent  demands.  In  those  situations,  the  following  diagnostic-analysis  scheme  is  suggested: 

—Group  the  items  into  families  according  to  some  specific  rules  depending  on  the  particularities 
of  the  products.  Proposition  2.6  proved  that  similar  seasonal  behavior  gives  optimal  aggregate 
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schedules  for  items  sharing  a  set  up.  Proposition  3.3  proved  that,  for  the  single  period  case,  either 
same  item  values  or  same  standard  deviation  of  the  item  demands  also  yield  optimal  aggregate  models. 

—Compute  aggregate  information  on  the  critical  parameters  used  to  describe  inventory  perfor¬ 
mance. 

—Solve  the  aggregate  models,  and  compute  lower  and  upper  bounds  for  the  optimal  value  of  the 
total  inventory  cost. 

—The  bounds  provide  a  range  within  which  the  inventory  cost  will  reside  if  the  decision  rules 
being  considered  would  be  implemented  to  the  actual  inventory  system.  Thus,  we  have  obtained  a  first 
approximation  to  assess  the  benefits  to  be  derived  from  the  application  of  those  decision  rules. 

We  believe  that  diagnostic  analyses  are  important  practical  tools.  The  have  been  neglected  in  the 
literature  and  a  great  deal  of  work  remains  to  be  done  to  determine  effective  diagnostic  methodologies 
not  only  for  inventory  systems  but  also  for  logistics  problems  in  general.  Within  inventory-control  sys¬ 
tems  the  next  natural  step  is  to  develop  diagnostic  models  to  study  items  that  do  not  share  the  same 
setup,  items  subject  to  quantity  discounts,  items  subject  to  inventory  constraints  and  other  cases  cited 
in  the  taxonomies  provided  by  Silver  [7],  Nahmias  (51,  and  Aggarwal  [1].  Production  planning  and  dis¬ 
tribution  systems  are  other  areas  for  potential  development  of  diagnostic  models.  The  need  for  such 
methods  should  also  motivate  researchers  to  explore  the  concepts  of  aggregation  and  heuristics  not  only 
under  the  view  point  of  computational  complexity  but  also  to  prevent  the  massive  data  collection  and 
processing  that  would  make  diagnostic  studies  prohibitive. 
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ABSTRACT 

Production  planning  for  large-scale  production  systems  requiring  the  alloca¬ 
tion  of  numerous  resources  is  considered  It  is  demonstrated  hoi*  the  dy  namic 
activity  analysis  developed  by  Shephard  leads  to  linear  programming  solutions 
of  production  planning  problems.  Three  types  of  planning  problems  arc  formu¬ 
lated;  maximization  of  output  levels  for  a  given  time  hori/on.  mimmi/aiion  fit 
production  duration  for  given  output  histories,  and  minimi/aiion  of  producimn 
costs  for  given  output  histories 


1.  INTRODUCTION 

Previous  efforts  (von  Lanzenauer  (4i  and  Candea  (Ij)  to  mathematitully  model  eapaciiaied,  mul¬ 
tistage  production  systems  have  been  motivated  by  manufacturing  shop  environments,  in  which  man> 
products  are  to  be  produced  using  a  given  network  of  facilities. The  problem  considered  is  to  deter¬ 
mine  workforce  levels  and  product  lot  sizes  in  each  time  period  so  as  to  minimize  costs  to  meet  exter¬ 
nal  demand  schedules  [1]  [4]. 

The  focus  of  this  paper  is  planning  for  production  systems  in  which  the  production  network  ele¬ 
ments  are  dedicated  to  producing  a  single  product,  but  allocation  of  numerous  resources  among  the  ele¬ 
ments  is  required,  and  other  kinds  of  production  planning  problems  are  posed.  Shephard,  .M-.Ayat  and 
leachman  [7],  and  Shephard  and  Al-Ayat  [8]  have  developed  a  continuous  flow  dynamic  activity 
analysis  model  of  production,  in  which  a  network  of  activities  characterizes  the  component  tasks  of  pro¬ 
duction.  Required  facilities  and  other  resources  are  considered  as  inputs  to  be  allocated  among  the 
activities.  The  presentation  in  [8]  is  taken  as  an  appropriate  point  of  departure  here.  In  the  following, 
this  model  is  extended  to  include  inventory  capacities  as  in  [4],  initial  inventories  of  intermediate  pro¬ 
ducts,  and  classes  of  exogenous  inputs. 

Three  types  of  production  planning  problems  are  formulated  and  solved  using  linear  programming 
methods.  The  problem  types  considered  are  maximization  of  output  accumulations  by  a  given  horizon; 
minimization  of  production  duration  for  required  output  histories;  and  minimization  of  costs  for  given 
output  requirements. 

2.  THE  MODEL 

Following  Shephard  and  Al-Ayat  (8|.  the  production  system  is  viewed  as  a  network  of  production 
activities  which  are  denoted  by  A,,  A 2 . -4^.  In  the  network,  nodes  represent  activities  and  arcs 


'This  research  vsas  supported  hv  the  Office  of  Naval  Research  under  C  ontract  N00014-76-C-01.t4  with  the  C;niversitv  ol  (  alifor- 
nia 

^No  attempt  is  made  here  to  reference  multistage  modeling  efforts  of  uncapacitated  systems  or  pure  serial  or  parallel  structures, 
for  a  survey  of  such  efforts,  see  (  andea  [ll 
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indicate  intermediate  product  transfers,  i.e.,  the  use  of  each  activity’s  output  as  input  by  other  activities. 

The  operation  of  each  activity  A,  is  measured  in  terms  of  an  intensity  function  /  =  0,  ! ,  2 . 

whose  value  at  time  i  indicates  activity  input  during  [/,  /  +  1)  and  output  at  time  (t  +  1)  when  taken 
with  technical  coefficients  defined  as  follows: 

(a)  c,(i).  1  =  0,  1,  2 . /=1 . N,  where  c,(t)  is  the  amount  of  output  of  activity 

at  time  t  per  unit  intensity  of  activity  A,. 

(b)  (r),  1  =  0,  1 ,  2,  , . .  ,  k=  I,  2,  ....  NK,  /  =  1 .  . . .  ,  N,  where  a,*; (r)  is  the  amount  of 
exogenous  input  type  k  required  at  time  /  per  unit  intensity  of  activity  A,.  The  first 
SS  <  NK  inputs  are  designated  nonstorable  resources  which  cannot  be  accumulated;  the 
remaining  exogenous  inputs  can  be  accumulated,  and  are  termed  storable  resources, 

(c)  a, lit),  t  =  0,  1.  2 . /=1 . N,  7=1 . A',  where  o„(r)  is  the  amount  of  inter¬ 

mediate  product  from  activity  A,  required  at  time  t  per  unit  intensity  of  operating  .4,. 

For  a  time  horizon  T  for  production  activity,  we  introduce  the  following  technical  limitations  on 
the  system: 

(a)  v  the  activity  intensity  bounds,  natural  bounds  resulting  from  available 
workspace  and  other  limitations  not  considered  as  exogenous  input; 

(b)  (AT* v,v  the  lime  histories  of  nonstorable  resource  levels  available  for  input 
to  the  system; 

(c)  1  F^(f)l/“®vivi,  :T,''va'"  the  time  histories  of  storable  resources  made  available  to  the  sys¬ 
tem,  where 

i  n(T) 

r-O 

is  the  cumulative  amount  of  resource  k  supplied  during  (0,  t); 

(d)  {inv°),_,  ,y,  the  initial  inventories  of  activity  product  for  intermediate  uses;  and 

(e)  {cap,(r)l/r®;  '  .  iy  the  bounds  on  accumulations  of  activity  product  awaiting  intermedi¬ 
ate  uses,  arising  from  limited  storage  capacity  for  in-process  inventories. 


For  our  purposes  here,  the  intensity  r,(/)  of  activity  A,,  /=] . .V,  on  each  lime  interval 

[r,  /-E  1),  r  =  0,  1,  2 . T—  1,  shall  be  partitioned  into  effort  producing  intermediate  product,  ;/(r), 

and  effort  producing  final  product,  where 

(1)  c/(r)  +  z/(/)  =  z,(/). 

These  variables  indicate  the  allocation  of  activity  output  produced  during  [r,  /  — 1)  to  final  and  inter¬ 
mediate  uses. 

A  production  plan  is  a  specification  over  some  finite  period  [0.  T)  of  the  activity  intensities 
!z/(/),  z/(r));_-" 

Such  a  plan  is  said  to  be  Jcasihic  for  LiT)  if  the  plan  belongs  to  the  set  L(T)  defined  by  the  following 
inequalities: 
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ajt)  IzfU)  +  z/(/)l  A-  =  1 . A^S,  1  =  0,  1 .  T-\. 

,=  \ 

f  ,v  , 

5^  fl,^(T)  [r/(T)  -F  r/(T)|  <  £  r*(7).  A  =  A-S  -f  1 . NK. 

/  =  0 . T  -  \. 

£  ^  a„(T)  [z/(t)  -F  r/(r)l  -  £  c,(t  -F  l)r/(r)  <  inv«, 

T=l  ,-l  rto 

7  =  1 .  . . .  ,  M  f  =  1 . r  -  1 ,  and 

£  fl„(0)  (r/(0)  -F  r/(0)l  <  inv®,  /  =  1 . A. 

/*! 

^  c,(t  -F  1)z/(t)  -  X  5^  a„(T){r/(T)  -F  z/(r))  <  cap^(r)  -  inv,®, 

r-O  r-l  ;-l 

7=1 . A.  /  =  1 . r  -  1 .  and 

-  ^  5„(0)  [z/(0)  -F  z,^(0))  <  cap,(0)  -  inv“,  7=1 . A. 

i-i 

z/(i)  -F  :f(f)  <  7=1 . A,  f  =  0 .  ?■  -  1, 

zf(r)  >  0.  7  =  I . A.  /  =  0 . 1. 

Constraints  L(T)1  and  L(T)2  express  resource  limitations.  Constraints  L(T)3  insure  adequate 
intermediate  product  transfers  occur  to  support  production  activity,  while  conslrainls  L(T)4  insure  that 
inventories  of  intermediate  products  do  not  exceed  capacities.  Finally,  constraints  L(T)5  limit  intensi¬ 
ties  to  nonnegative  values  less  than  intensity  bounds. 

The  set  of  linear  inequalities  L(T)  constitutes  a  continuous  flow  model  of  production,  in  which 
any  positive  intensity  of  activity  operation  supplies  completed  product  to  final  or  intermediate  uses,  or 
to  inventory.  In  the  case  that  intermediate  products  of  a  system  are  large,  discrete  units,  precedence 
relations  occur  between  activities  output  unit  by  output  unit,  and  constraints  L(T)3  and  L(T)4  must  be 
modified.  This  case  will  not  be  treated  here,  and  the  reader  is  referred  to  [6],  in  which  a  dynamic 
activity  analysis  was  developed  on  a  critical  path  analysis  network. 

In  the  case  more  than  one  activity  produces  a  certain  product,  constraints  L(T)3  and  L(T)4  must 
be  modified  for  the  activities  in  question.  See  (6|  and  [8].  However,  with  such  revisions  LIT)  still 
constitutes  a  set  of  linear  inequalities.  For  simplicity  of  exposition,  we  assume  in  what  follows  that  no 
two  activities  produce  the  same  product. 

3.  PRODUCTION  PROGRAMMING 

3.1.  Output  Maximization 

In  this  section,  programs  are  formulated  for  the  maximization  of  value  or  mix  functions  of  final 
output  accumulations.  We  consider  first  the  case  where  a  specific  product  mix  of  final  output  is 
desired,  and  the  problem  is  to  maximize  the  scale  of  this  mix  accumulated  by  a  time  horizon  T. 

Let  Zv*,  be  a  variable  indexing  the  scale  of  the  accumulation  by  time  T.  The  amounts  of  the  vari¬ 
ous  products  will  be  related  by  coefficients 

5,.  V  M  •  '■  =  1 . A. 


L(T)1. 

L(T)2. 

L(T)3. 

L(T)4. 

L(T)5. 
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where  +  i  is  the  amount  of  final  product  from  activity  A,  accumulated.  The  problem  in  question 

is  formulated  as  a  linear  program  as  follows; 

Maximize  z\  +  i 

subject  to 

T-l 

Ol.  a,,^  +  l2^  +  l  <  ^  c,(t  +  l)z/(/).  /■  =  1 . N. 

1-0 

02.  lz/(t).  z/(t)5;:?;  eLiT), 

03.  Z/^  +  i  ^  0. 

In  general,  the  program  involves  [S(N)  +  NK](T)  +  N  +  \  variables,  and  [3(A')  +  NK](T)  +  N 
constraints.  Clearly,  the  time  horizon  (i.e.,  the  number  of  time  periods)  is  the  most  sensitive  factor  in 
terms  of  problem  size  which  can  be  handled.  The  structure  of  the  constraint  set  can  be  modified  by 
rewriting  constraints  L(T)3  and  L(T)4  in  terms  of  intermediate  product  inventory  variables 

linv/(t)l.  /  =  0 . T- 

which  are  the  slack  variables  defined  by  constraints  L(T)3.  Using  these  variables,  we  reformulate  con¬ 
straints  L(T)3  and  L(T)4  as  follows; 

L(T)3.  ^  5„(f)[z/(f)  +  —  C/(r)z/(t  -  1)  —  inv/(t  -  1)  -f  inv/(r)  =  0, 

/-I 

/  =  1 . A',  t  =  1 .  . . .  ,  7"  —  1 ,  and 

£  u„(0)[z/(0)  -I-  z,'^(0)l  -t-  inv/(0)  =  iiivf.  ./  =  1 . .V. 

i-i 

L(T)4.  inv/(r)  ^  cap,(/).  y  =  1 . N,  t  =  0 . f  -  1. 


With  this  revision,  it  is  evident  that  the  constraints  L(T)1,  L(T)4  and  L(T)5  apply  only  time 
period  by  time  period,  and  the  constraint  matrix  exhibits  partial  block  diagonal  structure.  Potential  is 
thus  offered  for  application  of  large-scale  programming  procedures  such  as  decomposition.  (See  [5].) 

We  next  consider  the  case  where  the  value  of  output  produced  is  to  be  maximized.  We  suppose 
each  product  i  has  a  constant  unit  price  p,.  The  maximum  value  of  output  accumulated  from  produc¬ 
tion  activity  during  lO.  D  is  then  given  by  the  optimum  of  the  following  linear  program. 

7-1  ^ 

Maximize  ^  ^  P,c,(t  +  l)z/(r) 

1-0  /-I 

Subject  to  (z/(r),  z/ftll/rf  ,  <  L{T). 

The  remarks  about  problem  size  and  structure  concerning  the  previous  program  apply  here  as  well,  as 
only  the  product  mix  variable  and  the  /V  constraints  0 1  have  been  deleted. 

3.2.  Time  IVfinimization 

In  this  kind  of  planning  problem,  there  are  final  output  demands  which  must  be  met,  but  the 
overall  production  duration  is  to  be  minimized.  Final  output  demands  are  expressed  in  cumulative 
terms  as  follows.  Let 
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U.il).  /  =  1,  2 . T.  i  =  \ . N. 

denote  the  required  cumulative  delivery  of  final  product  /  by  time  t.  Here,  we  are  considering  the  situa¬ 
tion  where  early  delivery  of  final  products  is  acceptable  or  even  desirable.  These  demands  imply  con¬ 
straints 

(2)  £  c,(t-F1)z/(t)  >  /  =  1 . N.  t^\ . T. 

T-O 


We  first  consider  the  problem  of  finding  the  latest  starting  time  for  production  activity  sufficient  to 
satisfy  (2).  A  feasible  production  plan  for  this  problem  would  satisfy  the  linear  inequalities  (2)  and 
L(T).  An  optimal  plan  would  have  the  characteristic  that 

(3)  z,(0)  =  z;(l)  =  z,(2)  =  ••■  =  z,(ro)  =  0.  /  =  1 . N. 

where  to  is  as  large  as  possible.  Such  a  plan  may  be  found  (if  one  exists)  by  solving  a  sequence  of 
Phase  I  linear  programs  (see  [2])  as  follows. 

The  set  of  inequalities  under  consideration  is  of  the  form 

(4)  Az  -I-  Bx  =  b 

z  ^  0,  X  ^  0, 

where 

z  =  (zf(0),  zf(0) . z|(0).  7^(0),  z((l).  zf(l) . z^dl.z^d) . 

2[(T-  1).  zf(r-l) . z^fT-l),  2((7'-1)); 

X  =  (^1 . 

m  =  {N)(T).  n  =  {NK)(T)  -F  3(/V)  (D,  /-  2(A)  (T); 


A  is  the  (m  +  n)  X  I  matrix  of  coefficients  of  activity  intensities  in  (2)  and  L(T),  where  the  first 
m  rows  arise  from  (2); 

B  is  the  (m  +  n)  x  {m  +  n)  matrix  of  coefficients  of  slack  variables  for  said  constraints;  and 
b  is  the  {m  +  n)  vector  of  right  hand  side  constant  terms  of  the  constraints. 

For  the  inequalities  organized  in  this  fashion,  the  solution  algorithm  is  presented  below; 

STEP  0.  Initialize  r  =  T  —  1. 

STEP  1:  Solve  the  Phase  1  problem  with  the  first  (A)(t)  columns  of  the  tableau  corresponding  to 
(4)  deleted.  If  a  feasible  solution  is  found,  stop;  then 

Iz/d),  z/(r)!,'J,"  i-' 

is  an  optimal  production  plan.  If  the  problem  is  infeasible,  go  to  Step  2. 

STEP  2.  If  T  =  0,  stop;  then  the  set  of  inequalities  is  infeasible.  Otherwise,  decrease  t  to  t  —  1 
and  go  to  Step  I 

The  algorithm  is  seen  to  initially  ignore  all  columns  in  the  tableau  corresponding  to  activity  inten¬ 
sities  in  periods  before  time  (  T  -  1),  and  to  then  attempt  to  find  a  basic  solution.  If  none  can  be 
found,  columns  corresponding  to 


VOL  29.  NO  I,  MARC  H  I98Z 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


52 


R  C.  LEACHMAN 


zHT-2).  2/(7"- 2),  /■  =  1 . 

are  also  considered.  The  algorithm  continues  to  allow  the  use  of  columns  corresponding  to  activity 
operation  one  time  period  before  the  earliest  period  of  activity  operations  allowed  by  the  previous  itera¬ 
tion.  The  algorithm  terminates  either  the  first  time  a  feasible  basic  solution  is  found,  or  else  all 
columns  are  adjoined  without  finding  one.  In  the  former  case,  an  optimal  production  plan  is  found,  and 
in  the  latter  case,  the  output  schedule  (2)  is  infeasible  for  the  limitations  L(T). 

We  next  consider  the  problem  of  finding  the  earliest  time  all  product  accumulations  can  be  com¬ 
pleted.  It  is  immediately  apparent  that  an  approach  similar  to  that  considered  above  can  be  used  to 
solve  this  problem.  A  sequence  of  Phase  1  procedures  is  again  suggested,  but  in  this  case  starting  with 
the  possibility  of  positive  activity  intensities  only  during  [0,  1),  and  proceeding  forwards  in  time.  Later 
production  activity  is  allowed  period  by  period  until  either  a  feasible  basic  solution  is  found  for  the  first 
time,  or  else  the  horizon  is  reached  without  finding  one.  In  the  former  case,  an  optimal  production 
plan  is  obtained,  and  in  the  latter  case,  the  output  schedule  is  infeasible  for  the  limitations  imposed. 

3.3,  Cost  Minimization 

in  this  section  we  formulate  the  problem  of  determining  a  minimum  cost  production  plan  which 
meets  a  given  final  output  schedule  expressed  in  the  form  of  (2). 

Nonstorable  resources  are  assumed  to  have  capacity  costs  corresponding  to  the  peak  demands  for 
each  such  resource.  These  resources  cannot  be  accumulated,  so  that  the  production  system  must  have 
the  capability  to  accommodate  peak  loads.  Storable  resources,  however,  have  prices:  these  resources 
account  for  the  variable  cost  of  production  activity.  It  is  assumed  that  storable  resources  can  be  pro¬ 
cured  as  required,  so  that  inventories  of  same  are  ignored.  We  assume  intermediate  product  inven¬ 
tories  also  have  capacity  costs,  corresponding  to  peak  storage  requirements.  These  inventories  will  also 
bear  holding  costs  representing  opportunity  charges  for  unproductive  capital. 

Assuming  linear  capacity  costs,  the  problem  is  formulated  as  a  linear  program  as  follows.  Let 
C'(r)  =  (Cf(/) . ajft)). 

be  the  vector  of  costs  per  unit  capacity  for  nonstorable  resources  maintained  during  [t.  t  -E  1);  let 
CHt)  =  (t) . a(/)). 

be  the  price  vector  for  storable  resources  procured  for  use  during  [t,  t  +  1);  let 
C--(t)  =  (Cf  (t) . Cv(r)). 

be  the  vector  of  costs  per  unit  storage  capacity  maintained  during  [t,  t-E  1)  for  intermediate  products; 
and  let 

H(/)  =  . 

be  the  holding  costs  for  intermediate  products  held  during  [r,  r  -E  1). 

To  serve  as  variables  in  the  minimization,  let 
X  =  (3r, . Tv.s). 

denote  the  peak  requirements  in  any  unit  time  interval  of  nonstorable  resources;  let 
cap  =  (capi . capv). 

denote  the  required  intermediate  product  inventory  storage  capacities;  and  let 
{inv/(r),  z/fr),  z/ltlK'r,®  ;  ;,r' 
he  the  inventory  and  intensity  variables  as  before. 
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For  given  intensity  bounds 
(r,(r)l,^r®;  ,,;,v  ' 

and  initial  intermediate  product  inventories 
(inv®),., . ,v. 

the  minimum  cost  production  plan  meeting  the  output  schedule  (2)  is  given  by  the  optimum  of  the 
linear  program 

7-1  r-i  SK  v 

Minimize  Y,  CHt)  ■  X  +  ^  q(/)fl,*(7)[z/(7)  +  2/(7)] 

f-o  (-0  (t-.VS-H  /-I 

+  C'(/)  •  cap  -F  ^  ^  H,(i)  inv/(7) 

(-0  1-0  ,-i 

subject  to 

Cl.  C,(T+\)zf(T)  >  t/,{7),  /=1 . M  7=1 .  T. 

r-O 

.V 


C2. 

/-I 

[2/(7)  -F  2/(7)]  - 

AT*  <  0.  A'  =  1 ,  . . 

II 

0 

C3 

[2/(7)  +  2,^(7  )1  - 

1)  -  inv/(7-  1)  -F  inv/(7) 

./  =  1. 

....  N.  7=1,  . 

. .  .  T—  1,  and 

|2/(0)  +  2/(0))  4 

-  inv/(0)  =  inv®,  j 

=  1 . ,V. 

C4. 

inv/(7)  - 

cap/7)  <  0.  y  = 

=  1 . N,  7  =  0, 

...  .  7-1. 

C5 

r/(7)  +  2 

/(7)  <  2,(7).,/  = 

0 

11 

..  .  T-\. 

C6. 

X  =  (A',, 

....  >  0. 

cap  =  (capi . capv) 

^  0. 

inv'(7) 

=  (inv/ (7) . 

inv((7))  ^  0,  7  = 

0 .  r-  1. 

2/(7),  2 

/(7)  ^  0.  ./=  1. 

...  ,  N.  7  =  0 . 

.  T-  1. 

Here  constraints  C2  define  the  required  nonstorable  resource  capacities,  and  constraints  C4  define 
the  required  intermediate  product  storage  capacities.  Constraints  C3  and  C5  deal  with  inventory  bal¬ 
ance  and  intensity  bounds  in  the  same  manner  as  the  treatment  of  output  maximization  problems, 
while  constraints  Cl  repeat  (2). 

In  general,  the  program  includes  (4/V  +  NS){T+])  variables  and  (4A'  -f  .WS)  iT)  constraints. 
As  before,  the  fineness  of  the  time  grid  is  the  most  sensitive  factor  in  terms  of  the  problem  size  which 
can  in  practice  be  solved.  A  bordered  angular  configuration  for  the  constraint  matrix  is  now  displayed, 
in  which  constraints  C2,  C4,  and  C5  exhibit  a  block  diagonal  structure  with  coupling  variables. 
Although  this  is  a  more  difficult  structure  than  that  for  the  output  maximization  problems,  nonetheless 
it  can  be  exploited.  See  [5l. 

As  an  alternative  to  the  constant  capacities  for  each  nonstorable  resource  defined  by  constraints 
C2,  one  may  allow  capacities  to  be  adjusted  from  time  period  to  time  period  according  to  linear  costs. 
Many  authors  have  formulated  labor  workforce  levels  in  this  fashion,  allowing  hiring  and  firing  in  each 
period.  See  for  example  (3)  or  [4].  Such  formulations  may  be  integrated  here  as  appropriate. 
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ABSTRACT 

In  a  previous  paper.  Ihe  auihors  have  inlrodueed  a  class  of  mullivariate  llfe- 
limes  (MIFRA)  which  generalize  the  univariate  lifetimes  with  increasing  failure 
rale  average  dl'RA).  They  have  also  shown  that  this  class  satisfies  many  fun¬ 
damental  properties  In  this  paper  it  is  shown  ihai  other  concepts  of  multivari¬ 
ate  IFR  A  do  not  satisfy  all  of  these  properties  Relationships  between  MIFR  A 
and  these  other  concepts  are  given.  Finally,  positive  dependence  implications 
with  respect  to  these  classes  are  also  discussed 


1.  INTRODUCTION 

The  class  of  univariaie  lifeiimes  with  increasing  failure  rate  average  (IFRA)  has  been  of  great 
importance  in  reliability  theory.  The  importance  of  the  class,  and  properties  thereof,  are  discus.sed  in 
the  text  of  Barlow  and  Proschan  III  whose  notation  and  terminology  are  followed  here.  A  recent 
development  with  respect  to  this  class  has  been  the  resolution,  by  Block  and  Savits  12).  of  a  long  stand¬ 
ing  problem  concerning  the  closure  of  this  class  under  convolution. 

Several  recent  papers  by  Block  and  Savits  |3.  4],  and  by  Esary  and  Marshall  |6].  have  proposed 
various  multivariate  extensions  of  this  univariate  class.  It  is  our  purpose  in  the  present  paper  to  give 
the  relations  among  these  various  concepts  and  to  show  that  one  of  these  concepts,  which  was  desig¬ 
nated  MIFRA  in  Block  and  Savits  |3l.  is  preferable  to  others.  This  will  be  done  by  showing  that  among 
these  various  extensions  only  the  MIFRA  class  of  distribution  satisfies  all  of  the  properties  which  one 
would  reasonably  expect  for  a  class  of  multivariate  IFRA  distributions.  Furthermore,  dependence  pro¬ 
perties  and  the  lack  thereof  for  those  classes  are  also  discussed. 

One  deviation  which  we  shall  make  from  the  notation  of  Barlow  and  Proschan  (1|  is  to  call  a 
structure  function  <6(.y)  nwnoione  if  it  is  increasing  in  its  components  and  in  addition  </)(0)  =  0  and 
<A(1)  =  I.  Esary  and  Marshall  (51  have  called  such  a  function  coherent.  We  conform  to  the  terminol¬ 
ogy  of  Barlow  and  Proschan  [II.  and  call  a  structure  function  cohcivni  (called  fully  coherent  b\  Esarv 
and  Marshall)  if  it  is  increasing  in  its  arguments  and  if  all  components  arc  essential.  The  life  function  r 
corresponding  to  a  system  <6  is  called  monoione  (coherent)  if  <b  is  monotone  (coherent)  See  Esary  and 
Marshall  [5|  for  a  discussion  of  life  functions. 

2.  MULTIVARIATE  IFRA 

Block  and  Savits  (31  have  introduced  a  concept  of  multivariate  IFRA  which  is  given  in  the  follow¬ 
ing  definition. 


'this  resOiircIi  was  suppiirlcd  by  ONR  C'onlravl  NIH)OI4-7ri-C  -()X.W  aiul  by  NM  (irani  \t(  .'77.()|45g 
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DEFINITION  1.  Let  T  =  (Ti.  ....  T„,)  be  a  nonnegalive  random  vector.  The  vector  T  is  said 
to  be  Vf/FRA  iff 

£•"  [/)(!)]  ^  £[//“(I/a)l 

for  all  continuous  nonnegative  increasing  functions  h  and  all  0  <  a  ^  1. 

The  above  definition  can  be  better  appreciated  by  first  examining  a  special  case.  If  we  let 
h(T)  =  Ia{T)  where  A  =  (/],  «)  x  (rj,  oo)  x  ...  x  (t„,  «>)  and  I  is  the  indj^cator  function  of  the  set 
A  (i.e.  Ia^Z^  ~  1  T  ^  A),  the  defining  condition  reduces  to  /^O)  <  FiaF)  for  all  0  <  a  ^  1 
and  all  0  <  T  where  FU)  =  P(T  >  t).  This  is  a  simple  multivariate  analog  of  the  usual  univariate 
definition  of  IFRA  (see  Remark  on  p.  84  of  (11)  and  is  given  in  A  below.  Furthermore,  monotone  sys¬ 
tems  have  lifetimes  of  the  form  ti(T)  =  max  min  T,  and  tjCT)  =  min  max  T.  where  P,  are  min 

~  i€Ar, 

path  sets  and  the  A,  min  cut  sets.  These  functions  have  the  property  that  riaT)  =  aT(T)  for  every 
0  <  a  <  I  and  are  called  homogeneous  functions.  Marshall  and  Shaked  [8]  have  pointed  out  that 
Definition  1  is  equivalent  to  g(.T)  being  IFRA  for  every  increasing  homogeneous  function  g.  This 
implies  that  if  Tis  MIFRA  then  every  monotone  system  formed  using  Tis  IFRA. 

Several  other  possible  conditions  for  multivariate  IFRA  have  been  proposed. 

DEFINITION  2.  Let  T  =  (T) . T„,)  be  a  nonnegative  random  vector  with  survival  function 

FU)  =  P(T  >  U.  The  vector  T  is  said  to  satisfy  condition  ;  if  the  condition  following  /  is  satisfied 
where  i  -  A,  B,  C,  X,  D,  E,  or  F. 

A\  P'U)  ^  F(aj_)  for  all  0  <  a  <  1  and  all  0  <  j. 

fl  :  r  is  such  that  each  monotone  system  formed  from  T  is  univariate  IFRA. 

C  :  T  is  such  that  there  exist  independent  IFRA  random  variables  Tj . A'^  and  monotone 

life  functions  r,,  ;  =  1 . m  such  that  Ti  =  t ,{X\ . for  /  =  1 . m. 

X:  T  is  such  that  there  exist  independent  IFRA  random  variables  T, . and  nonempty 

sets  5,  of  1 1 . A'l  such  that  T,  =  ^  AT,  for  /  =  1 . m. 

D  :  T  is  such  that  there  exist  independent  IFRA  random  variables  T, . X^  and  nonempty 

subsets  5,  of  ( 1 . A.  )  such  that  r  =  min  AT,  for  ;  =  1 . ni. 

liS,  ' 

f  :  Jis  such  that  the  minimum  of  any  subfamily  of  Ti . T„,  is  IFRA. 

F .  Tis  such  that  min  a,T,  is  IFRA  for  all  a,  >  0,  /  —  1 . m. 

I 

Conditions  A,  B,  C,  D,  E,  Fhave  been  given  by  Esary  and  Marshall  [6]  and  condition  X  was  given 
by  Block  and  Savits  14] . 
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3.  RELATIONSHIPS  AMONG  THE  CONDITIONS 

The  following  relationships  hold  between  MIFRA  and  the  seven  conditions  given  in  Section  2. 


{I  % 

A(~‘)F  B 

%  ^ 

E 

Figure  I .  Relationships  among 
the  conditions 


With  the  exception  of  the  implication  I  ^  C,  the  above  figure  is  complete,  i.e.  no  more  implica¬ 
tions  are  possible.  It  is  not  known  where  I  C  holds,  but  we  conjecture  that  it  does  not.  Proofs  of 
the  remaining  implications  and  counterexamples  will  now  be  given. 

Because  of  results  in  Esary  and  Marshall  [6l  we  need  only  show  how  MIFRA  and  I  compare  with 
concepts  A,  B,  C,  D,  E  and  with  each  other. 

3.1  Comparison  of  MIFRA  with  Other  Conditions 

a.  C  ^  MIFRA.  This  follows  from  (iii)  of  Theorem  4.1  of  Block  and  Savits  [3]. 

b.  I  =■»  MIFRA.  See  (iv)  of  Theorem  4.1,  [3]. 

c.  MIFRA  F.  Apply  (PI)  and  (P5)  of  Theorem  2.3,  [3]. 

d.  MIFRA  B.  This  is  (PI)  of  Theorem  2.3,  (3). 

e.  MIFRA  I.  Given  in  Example  3.3  of  Block  and  Savits  [4]. 

f.  MIFRA  >  C.  (and  MIFRA  >  D).  Example  3.2,  [4]. 

g.  A  ^  MIFRA.  Since  A  ^  B. 

h.  B  ^  MIFRA.  Since  B  A. 
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3.2  Comparison  of  I  with  Other  Conditions 

a.  Dj^JL.  Example  3.3  of  Block  and  Savits  [4], 

b.  C  >  I.  Since  D  ^  1 

c.  I  ^  D,  Lei  -Vi  .3'2.  A',  be  absolutely  continuous  IFRA  random  variables  Form 

Fi  =  .A'l  +  .Yi  and  fS  ^  -^2  ■*"  definition  (  F|.  I  j)  satisfies  I,  but  by  Section  10  of 

Esary  and  Marshall  [6]  (F,.  f  ,)  does  not  satisfy  D. 

All  other  counterexamples  and  implications,  with  the  exception  of  I  C.  follow  from  the  above 

4.  PROPERTIES  RELEVANT  TO  MULTIVARIATE  IFRA  DISTRIBUTIONS 

The  class  C  of  MIFRA  distributions  has  been  shown  by  Block  and  Savits  (3)  to  satisfy  the  follow¬ 
ing  properties: 

(Property  1):  Closure  under  the  formation  of  monotone  systems,  i.e.  if  (Ti . T„)  i  C  and 

ri . T„,  are  monotone  life  functions,  then  (TifTi . T„) . . T„))  6  C. 

(Properly  2):  Closure  under  limits  in  distribution. 

(Properly  3):  Marginals  are  in  the  same  class. 

(Property  4):  Closure  under  conjunction  of  independent  sets  of  lifetimes,  i.e.  if  (Ti . T, ) 

and  (.S’l . 5,„)  €  C  and  are  independent,  then  (T; . T„,  S’, . S,„)  €  C. 

(Property  5);  Closure  under  scaling,  i.e.  if  (T, . T",)  €  C and  o,,  /  =  1 . n,  are  nonnega¬ 
tive  constants,  then  ...  ,  a„T„)  €  C. 

(Property  6):  C  is  closed  under  well  defined  convolution,  i.e.  if  (T, .  T„)  €  C  and 

(S| . S„)  €  C  and  independent,  then  iT^  H-  S, .  T„  +  S„)  €  C. 

It  is  reasonable  that  any  class  of  multivariate  IFRA  distributions  should  satisfy  these  conditions. 
Block  and  Savits  [3)  have  shown  that  the  MIFRA  distributions  satisfy  these  conditions.  We  will  now 
show  that  each  of  the  conditions  A,  B,  C.  I,  D,  E,  F  fails  to  satisfy  at  least  one  of  these  properties. 

4.1  A  Does  Not  Satisfy  Property  I. 

This  follows  since  A  ^  B. 

4.2  A  Does  Not  Satisfy  Property  5. 

This  follows  since  B  ^  A. 

4.3  C  (and  Dt  Do  Not  Satisfy  Property  5. 

Let  (r,,  T i)  =  (min(A',  Z),  min(F.  Z))  where  A",  I' and  Z  are  independent  exponential  random 

variables  with  mean  one.  For  a,  ^  r/j,  assume  (o|F],  <72^2^=  ^’■|('^i . Af^),  r2(T| . V^)) 

where  r,.  12  monotone  life  functions  and  A", . A'^  are  independent  iFRA  lifetimes.  It  follows 

from  Remark  2.2a  of  Block  and  Savits  (41  that  there  exist  independent  exponential  random  variables  L', 
(  ,  W  such  that  (r/|7|,  ctiT^)  =  (min(t/.  IF),  min  ( F.  IF)).  But  the  conditions  0=  F(«F|  =  oiT^) 
and  A’(min((y,  IF)  =  min(I',  IF))  >  0  are  not  compatible. 
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4.4  I  Does  Not  Satisfy  Property  I. 

Lei  .V  ynd  L  be  independeni  exponential  lifetimes.  Define  T\iX.  Y)  =  min(,\',  >)  and 
ttI.V,  Y)  =  Y  and  assume  that  (t|.  rj)  =  iU  +  W,  Y  +  W)  where  U,  i’,  and  W  are  independeni 

IFRA  lifetimes.  Now  by  Theorem  2.8  of  Block  and  Savits  [4],  one  of  Land  W  is  exponential  and  one 

is  concentrated  at  0.  If  W  is  exponential  since  PlminlA",  }')  ^  K)  =  1,  it  follows  that  Pit  =  0)  =  1 
and  so  PiminiA'.  Y)  =  K)  =  I  which  is  impossible  if  X  and  Tare  independent  exponentials.  If  I  is 
exponential,  then  PiW  =  0)  =  1  so  that  mini  AT.  Y)  and  Y  are  independeni,  again  an  impossibility 

4.5  DDoes  Not  Satisfy  Property  6. 

Let  .V,  >■,  and  Z  be  independeni  absolutely  continuous  IFRA  lifetimes.  Then  both  i.V.  >  )  and 
iZ,  Z)  are  trivially  in  D.  However,  if  (X.  Y)  +  (Z,  Z)  =  (X  +  Z.  Y  +  Z)  was  in  D.  then  by  Section 

10  of  Esary  and  Marshall  (6|  A”  +  Z  and  Y  +  Z  would  be  independeni,  but  they  can't  be. 

4.6  fDocs  Not  Satisfy  Property  I. 

This  follows  since  E  B. 

4.7  PDoes  Not  Satisfy  Property  1. 

This  follows  since  A  F. 

5.  POSITIVE  DEPENDENCE 

The  first  published  definition  of  a  class  of  multivariate  nonparametric  reliability  distributions  was 
Harris'  [7|  definition  for  multivariate  increasing  hazard  rate.  This  definition  included  a  type  of  positive 
dependence  (i.e,  right  corner  set  increasing).  See  Barlow  and  Proschan  [1]  for  a  discussion  of  \arious 
types  of  positive  dependence.  Subsequent  definitions  have  not  included  such  assumptions.  The  opin¬ 
ion  which  is  now  generally  held  is  that  the  various  concepts  of  positive  dependence  are  not  intimately 
related  to  useful  definitions  for  nonparametric  multivariate  life  classes.  In  other  words,  if  a  multivariate 
lifetime  has  an  increasing  failure  rate  or  failure  rate  average,  then  it  need  not  follow  that  the  lifetime  be 
positively  dependent  in  some  sense.  In  fact,  if  such  a  definition  implies  positive  dependence,  then  it  is 
probably  too  strong.  Examples  of  such  definitions  are  conditions  C.  D  and  I  which  are  easily  show  n  to 
imply  association.  We  will  show  that  the  more  useful  definitions  A  (=  F),  B.  E  and  especially  MIFR.A 
do  not  imply  even  positive  quadrant  dependence,  which  is  one  of  the  weaker  types  of  positi\e  depen¬ 
dence. 

5.1  A  and  E  Positive  Quadrant  Dependence 

C  learly,  Fiff.  tj)  =  P(Tf  >  /,,  T;  >  r2*  =  exp(-ti-r2~'i'2^  satisfies  A  and  E.  but  F^l^.  h)  ^ 
Pi  7  I  >  /,)P(7'2  >  /2> 

5.2  MIFRA  and  B  Positive  Quadrant  Dependence 

Consider  (T,.  Ti)  -  ( C/,  1  -  U)  where  1/ is  a  uniform  distribution  on  the  unit  interval.  Clearly, 
F(i\,  ti)  <  P{T\>  i\)P(Ti  >  I2),  but  T],  T2,  minfFi.  T2)  and  maxITi,  T2)  have  a  univariate  uni¬ 
form  distribution  and  so  are  IFRA.  Thus,  B  is  satisfied.  Furthermore,  Theorem  3.5  of  Block  and  Sav¬ 
its  13|  gives  that  (T,,  T2)  is  MIFRA  if  the  indicator  function  of  every  fundamental  upper  domain  in 
R2*  satisfies  the  inequality  of  Definition  1.  A  set  /I  is  a  fundamental  upper  domain  if  it  has  the  form  in 
Figure  2  below  where  0  <  jr,  <  X2  ^  ^  and  Vi  >  ,V2  >  ■  •  >  .v„  >  0.  From  Figure  2, 
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((7-,,  Tj)  €  ^)  =  u  ir,  >  X,.  7-j  >  .^’,1  =  y  U,  <  t/  <  1  -  >-,) 

1=1  1-1 


((r,/a.  Tj/rt)  €  =  y  {ax,  <  O'  <  I  -  a.v,). 

—  I 

Let  /  =  (/:  X,  +  .V,  <  1),  J  =  (,/:  ax,  +  a>',  <  I).  Since  0  <  a  ^  1,  /  C  /  Then 

f(/.(7-i.  =  ^  [y  I'''  <  ^  <  '“>'*] 


£'/“[/;■  (7, /a,  r:/a)l  =  7''“  jy  (ax,  <  O  <  1  -  a>',lj 
^  P'"*  j  y  lax,  <  0  <  1  -  a,v,ij. 


By  renumbering,  if  necessary,  we  may  assume  without  loss  of  generality  that  /  =  {1,  2 . /)).  Now 

define  Af  =  {2  <  A  </?:  ax*  ^  I  -  a.t'*  |l  =  U(  <  <  . . .  <  A,)  and  set  Aq  =  1  and  A>+,  =  /;  +  1. 

Then 

y  lax,  <  0  <  I  -  ay,!  »  "y  laA*,  ,  <  0  <  1  -  ay*,_,| 

i(/  i-i 

and  these  latter  sets  are  disjoint  intervals.  It  follows  from  Minkowski’s  inequality  for  0  <  a  <  1  that 

(r+l 

P'^“l  y  (ax,  <  U  <  I  -  ay", 11  -  PloXn^  I  <  O  <  1  -  ay"*,-,! 

t(i  |/-i 

>  X  ,  <  t/  <  1  -  a>*,_,l. 

(-1 
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Since 

(j  (v,  <  iy  <  1  -  >’,!  =  ij  y  \x,  <  u  <  \  -  L,), 

lil  /»!  k, 

it  suffices  to  show  that  for  /  =  1 . r  +  I 

P  U  [x,<u<\-  T,i|  ^  P'l"  iaA*,  U  <\-  |1- 

A,  |$,«A,-1 

But  the  union  on  the  left  hand  side  is  contained  in  the  interval  ^  <  U  <  \  —  and 

Xki  I  +  >'a,-i  <  1  since  for  /c,_,  <  /  <  A:,  —  1,  /  €  /.  And  so  we  are  done  if  we  show  that 
1  —  r  <  (1  —  ar)'^"  for  0  <  r  <  1,  0  <  a  <  1.  This  last  inequality  is,  however,  easily  verified. 
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ABSTRACT 

In  a  1973  paper  J.  D.  Esary,  A.  W.  Marshall,  and  F.  Proschan  [51  con¬ 
sidered  a  shock  model  giving  rise  to  various  nonparameiric  classes  of  life  distri¬ 
butions  of  interest  in  reliability  theory.  A  number  of  authors  have  extended 
these  results  in  a  variety  of  directions.  In  this  paper,  alternative  proofs  of  the 
increasing  failure  rate  (IFR)  and  decreasing  mean  residual  life  (DMRL)  results 
are  given  which  do  not  make  use  of  the  theory  of  total  positivity.  Some  bivari¬ 
ate  extensions  are  then  obtained  using  a  shock  model  similar  to  that  originally 
used  by  H.  W  Block,  A.  S.  Paulson,  and  R.  C.  Kohberger  (21  to  unify  various 
bivariate  exponential  distributions. 


INTRODUCTION 

In  a  1973  paper,  Esary,  Marshall,  and  Proschan  [5]  investigated  a  shock  model  which  gives  rise  to 
various  nonparameiric  classes  of  life  distributions  which  are  of  interest  in  reliability  theory.  We  will 
begin  this  paper  by  reviewing  these  univariate  life  distribution  classes  and  then  stating  the  theorem  due 
to  Esary,  Marshall,  and  Proschan.  We  then  provide  new  proofs  to  some  parts  of  this  theorem.  The 
proof  uses  elementary  methods  rather  than  results  from  the  theory  of  total  positivity.  We  will  then 
describe  a  bivariate  shock  model  which  has  been  used  by  Block,  Paulson,  and  Kohberger  (2)  to  unify 
some  of  the  various  bivariate  exponential  distributions  which  have  appeared  in  the  reliability  literature. 
Finally,  we  will  discuss  a  number  of  bivariate  shock  models  yielding  bivariate  conditions  that  can  be 
viewed  as  generalizations  of  the  univariate  new  better  than  used  (NBU)  condition. 

1.  UNIVARIATE  LIFE  DISTRIBUTION  CLASSES 


In  this  section,  we  will  consider  various  models  of  stochastic  aging.  For  every  model  of  wearout 
(or  adverse  stochastic  aging)  there  is  an  analogous  model  of  improvement  (or  beneficial  stochastic 
aging).  Throughout,  we  will  use  the  word  "increasing"  ("decreasing")  to  mean  "nondecreasing"  ("nonin¬ 
creasing"). 

Consider  a  single  component  or  system  with  a  random  lifetime  denoted  by  the  random  variable  T. 
We  will  denote  the  distribution  function  of  T  by  F.  The  survival  function,  denoted  by  F,  is 
F(r)  PiT  >  r)  =  1  -  F(i).  We  will  assume  throughout  that  F(O-)  =  1  (i.e.,  that  7" is  a  nonnega¬ 
tive  random  variable).  We  now  define  the  various  nonparametric  life  distribution  classes  of  interest. 


If  a  life  distribution  F  satisfies 


(1.1) 


F(i  +  x) 
Fit) 


is  decreasing  in  -«>  <  r  <  oo  for  fixed  .x  >  0 


then  F  \s  called  an  increasing  failure  rate  (IFR)  distribution. 
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A  distribution  F  is  called  an  increasing  failure  rate  average  (IFRA)  distribution  if 

(1.2)  f(ar)  >  F*(/)  for  0  <  a  <  I.  /  >  0. 

A  distribution  F  is  called  a  new  better  than  used  (NBU)  distribution  if 

(1.3)  Fix  +>')  <  F(x)F{y)  for  jr  >  0,  y  ^  0. 

A  distribution  F  with  a  finite  mean /x  is  called  a  new  better  than  used  in  expectation  (NBUE)  dis¬ 
tribution  if 

(1.4)  J  F(x)dx  <  ixFir)  for  i  >  0. 

A  distribution  Fwith  finite  mean  is  called  a  decreasing  mean  residual  life  (DMRL)  distribution  if 

1  r°°  - 

(1.5)  ^ —  I  /"(xlalx  is  decreasing  in  r. 

For  eacii  of  the  preceding  classes  of  life  distributions,  there  is  a  corresponding  class  which  is  a 
model  of  beneficial  stochastic  aging.  These  are  obtained  from  the  preceding  classes  by  replacing 
decreasing  with  increasing,  increasing  with  decreasing,  and  reversing  inequalities.  This  results  in  classes 
known  as  the  decreasing  failure  rate  (DFR)  class,  the  decreasing  failure  rate  average  (DFRA)  class,  the 
new  worse  than  used  (NWU)  class,  the  new  worse  than  used  in  expectation  (NWUE)  class,  and  the 
increasing  mean  residual  life  (IMRE)  class.  We  next  discuss  some  alternate  characterizations  which 
perhaps  supply  more  intuition. 

We  begin  by  considering  again  a  random  variable  T  with  distribution  function  F  denoting  the  ran¬ 
dom  lifelength  of  some  device.  For  any  x  ^  0,  we  will  consider  the  survival  function 

(1.7) 

Fix) 

Intuitively,  F,  represents  the  survival  function  of  the  residual  life  of  the  original  device  condi¬ 
tioned  on  the  fact  that  it  survived  past  time  x.  We  next  prove  a  lemma  and  a  number  of  propositions 
which  give  alternate  characterizations  of  most  of  these  classes. 

LEMMA  1: 

f^y  =  fory  ^  X  ^  0 

PROOF:  Omitted. 

PROPOSITION  1:  Let^ijr  and /a  be  the  means  of  F,  and  F,  respectively. 

(i)  Fhas  an  IFR  distribution  if  and  only  if  F^(z)  ^  F,  (r)  for  all  z  when  .v  >  y. 

(ii)  F  has  an  NBU  distribution  if  and  only  if  F^i:)  <  Fi:)  for  all  z  when  .v  ^  0. 

Let  M  be  finite 

(iii)  F  is  a  DMRL  distribution  if  and  only  if  Mv  ^  Mi  fot  ^  ^ 

(iv)  Fis  an  NBUE  distribution  if  and  only  if/x,  <  Ix  for  .v  ^  0. 
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PROOF:  Omitted. 

PROPOSITION  2: 


(i)  F  is  an  IFR  distribution  if  and  only  if  is  an  NBU  distribution  for  all  y. 

(ii)  Let  F  have  a  finite  mean.  Then  F  has  a  DMRL  distribution  if  and  only  if  F,  has  an 
NBUE  distribution  for  ally. 

PROOF:  The  proof  of  (i)  follows  directly  from  Proposition  l(i),  Lemma  1,  and  Proposition  l(ii). 
The  proof  of  (ii)  follows  from  Proposition  l(iii).  Lemma  1,  and  Proposition  l(iv). 

2.  THE  EMP  SHOCK  MODEL  AND  THEOREM 

In  this  section,  we  will  discuss  the  Esary,  Marshall,  and  Proschan  (EMP)  shock  model  and 
theorem.  We  will  provide  alternate  proofs  to  some  of  these  results  and  indicate  some  generalizations. 

We  begin  by  defining  various  nonparametric  classes  of  discrete  distributions  for  positive  integer 
valued  random  variables. 


Let  M  be  a  positive  integer  valued  random  variable  putting  no  mass  at  infinity.  Then  we  shall  say 

that 


(2.1) 

(2.2) 

(2.3) 

(2.4) 

(2.5) 


A/  is  a  discrete  IFR  random  variable  if  (P{M  >  k)/P(Af  >  k  -  1)  is  decreasing  in 
k  =  1,2 . 

M  is  a  discrete  IFRA  random  variable  if  {PiM  >  k))'^*  is  decreasing  in  k  =  1,  2 . 

M  is  a  discrete  NBU  random  variable  if  PiM  >  k  +  j)  ^  PiM  >  k)PiM  >  j)  for 
y,  k  =  0.  1,2 . 


M  is  a  discrete  NBUE  random  variable  if  PiM  >  k)  ^  PiM  >  y)  ^  ^  PiM  >  j) 

j-O  j'^k 

for  k  =  0, 1, 2,  ...  and  M  has  a  finite  mean. 


Mis  a  discrete  DMRL  random  variable  if  [PiM  >  k))~' 
in  k  =  0, 1, 2,  ...  and  M  has  a  finite  mean. 


IX  F(M  >  j) 

ij-* 


is  decreasing 


Note  that  these  classes  of  life  distributions  are  simply  discrete  analogues  of  the  classes  that  we 
have  previously  defined  in  Section  1.  If  we  reverse  the  inequalities  and  change  decreasing  to  increasing 
we  can  define  analogous  classes  of  discrete  DFR,  DFRA,  NWU,  NWUE,  and  IMRL  distributions 
(except  that  we  do  not  require  M  to  have  a  finite  mean  in  the  NWUE  and  IMRL  cases). 

Next  we  describe  the  fundamental  shock  model  discussed  by  Esary,  Marshall,  and  Proschan. 
Consider  a  device  subjected  to  shocks  which  occur  in  time  as  events  of  a  Poisson  process  with  intensity 
X.  The  device  fails  on  the  Mth  shock  where  M  is  a  positive  integer  valued  random  variable  which  is 

M 

independent  of  the  shock  process.  Letting  T  be  the  lifelength  of  the  device,  we  see  that  T  =  X 

where  the  X,'s  are  independent  and  identically  distributed  random  variables  having  an  exponential 
distribution  with  parameter  X.  The  survival  function  is  of  the  form 
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/■(/)  =  P{T  >  /)  =  £  PiSit)  =  k)PiM  >  k) 
*-0 

where  {5(/):  t  ^  0)  is  a  Poisson  Process  with  intensity  X. 

We  now  state  the  main  theorem. 


THEOREM  1:  (Esary,  Marshall,  and  Proschan): 

(i)  If  M  is  discrete  IFR  (DFR),  then  Fis  IFR  (DFR) 

(ii)  If  M  is  discrete  IFRA  (DFRA),  then  Fis  IFRA  (DFRA) 

(iii)  If  M  is  discrete  NBU  (NWU),  then  F  is  NBU  (NWU) 

(iv)  If  M  is  discrete  NBUE  (NWUE),  then  F  is  NBUE  (NWUE) 

(v)  If  M  is  discrete  DMRL  (IMRL),  then  F  is  DMRL  (IMRL). 

Elementary  proofs  of  parts  (iii)  and  (iv)  were  given  in  Esary,  Marshall,  and  Proschan.  The  proofs 
of  the  remaining  parts  required  results  from  the  theory  of  total  positivity.  We  now  give  proofs  of  (i) 
and  (v)  which  involve  elementary  methods.  We  repeat  the  proof  of  (iii)  to  illustrate  some  techniques 
which  we  will  utilize.  Furthermore,  in  the  subsequent  section,  we  use  the  techniques  developed  to 
establish  new  bivariate  results. 


PROOF  OF  THEOREM  1  (iii):  We  note  that 


F(r  +  jf)  =  £  P(SU  +.v)  =  k)P{M  >  k). 

k-O 

Since  (S(/);  t  >  0l  is  assumed  to  be  a  Poisson  process,  we  have  by  the  stationary  independent  incre¬ 
ments  property  that 

F(f  +  x)  =  £  £  F(S(f)  =  A)F(S(x)  =  )c  -  A)F(M  >  A  +  Oc  -  A)) 

*=o  a=o 

=  Z  Z  =  A)F(5(x)  =  k  -  A)F(A/  >  A  -I-  (X  -  A)) 

a-o 

=  £  £  (F(S(f)  =  A)F(S(x)  =  l)P(M  >  A  -F  /). 
a-o  /-o 


Applying  the  discrete  NBU  condition  on  A/ we  have  that 


F(/  +  x)  <  £  £  PiSU)  =  A)P(S(x)  =  /)P(M  >  A)P(M  >  I) 
a-o  1-0 


=  F(S(r)  =  A)F(A/ >  A) 
a-o 


2^  F(S(x)  =  DPiM  >  /) 
1-0 


Consequently,  Fit  -F  x)  <  Fii)Fix). 


PROOF  OF  THEOREM  1  (i):  Consider  a  random  variable  T  whose  life  distribution  arises  from  a  shock 
model  in  which  M  is  discrete  IFR.  In  order  to  show  that  it  is  IFR  It  suffices  from  Proposition  2  that  F, 
is  NBU  for  all  y.  However,  in  light  of  the  preceding  proof,  we  need  only  show  that  F,  arises  from  a 
shock  model  in  which  the  random  number  of  shocks  survived  has  a  discrete  NBU  distribution. 
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Now 


T  /'(SO' +  w  =  A)/»(Af  >  A) 

^ - 

F(y)  '£P(S(y)=  j)P(M  >  J) 

J-O 

£  ^  PiSiy)  =  j.  Sy(u)  =  A  -  j)P(M  >  7  +  (A  -  7)) 
_  A-O  y-0 _  _ 


Y_^P(S{y)  =  j)P(M  >  J) 
>-o 


where  S„(u)  =  5(7'  +  u)  —  S(y) 


00  00 


£  =  =  +  (A -7)) 

7-0  A-j _ 

£/»(50')  =  »/’(A/  >  7) 

j-Q 


00  00 


X  X  /’(SO-)  =  7)/’(5,(m)  =  k)P(M  >  J  +  k) 

_  ^-0  *-Q _ 

£  P(S O' )  =  /)/* (A/  >  7) 

7-0 

£/'(S0')  =  ;)/’(A/ >7  +  A:) 
=  J  /'(S,(i/)  =  k)  ^ - 


*-o 


Y^PiSiy)^  j)P(M  >7) 

7-0 


Now  the  expression 


2;  PiSiy)  =  j)PiM  >  j  +  k) 

(2.6)  ^ - 

J  PiSiy)  =  7)  (/’(A/  >  7) 

7-0 

is  a  function  of  /c  which  has  the  value  1  when  A:  =  0  and  is  a  decreasing  function  of  k  since 
PiM  >  j  +  k)  decreases  in  k  for  fixed  7.  In  addition,  since  M  is  discrete  IFR,  it  is  also  discrete  NBU 
and  hence  PiM  >  j  +  k)  ^  PiM  >  j)PiM  >  k).  Thus,  the  expression  in  equation  2.6  is  bounded 
between  0  and  PiM  >  k).  However,  k  ,  PiM  >  k)  0.  Consequently,  the  expression  in 
equation  2.6  is  a  survival  function  of  a  discrete  random  variable. 


Next  we  let  M*  be  a  discrete  random  variable_  which  is  independent  of  the  Poisson  process 
{Sj  (/):  t  ^  0l  which  has  this  survival  function.  Then  Fyiu)  is  of  the  form 

Fyiu)  =  2;  PiSyiu)  =  k)PiM*  >  k). 

k-O 

To  complete  the  proof  we  need  only  show  that  since  M  is  discrete  IFR,  M*  must  be  discrete  NBU. 


The  condition  that  must  be  checked  is 

PiM*  >*:  +  /)<  PiM*  >  k)PiM*  >  I). 
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This  is  equivalent  to 

^P(S(y)  =  j)P(M  >  j  +  k+l)  >./  +  A)  £  P(S(y)  =  j)P(M  >  j  +  D 

7=0  ^  7-0  7^0 

~  “  " 

^P{S(y)  ^  j)P(M  >  j)  ^P{S(y)  =  j)PiM  >  j)  ^  P(S{y)  =  i)P(M  >  j) 

7-0  7-0  7-0 

or  after  some  algebra  to 

^^(P(M  >  j  +  k  +  l)P(M  ■>  1)-  P(M  >  j  +  k)P(M  >  1  +  t))P(S  O' )  =  S)P(S(y  )  =  ,/)^0. 

A-Oy-O 

Since  for  A  <  0  or  j  <  0,  P(S(y)  =  ^)P(S(y)  =  ,/)  =  0,  we  may  rewrite  the  above  condition  as 

OO  CO 

£  2)  {P(M>  j  +  k  +  l)P(M  >  \)~  P{M>  J  +  k)P(M  >  A  + /))P(S(>)  =  A)P(50')=  J)  <  0. 

A*  — OO  J^  —  OQ 


By  breaking  up  this  double  summation  into  three  parts:  over  A  <  j  +  k,  S  =  j  +  k,  and 
A  >  7  +  A,  realizing  tht  the  middle  term  of  the  preceding  sum  is  zero;  letting  A'=  7  + A  and 
7'  =  A  -  A  so  that  7  =  A'  -  A  and  A  =  7'  +  A  (with  the  consequence  that  the  condition  A  >  7  +  A  is 
equivalent  with  7'  +  A  >  A')  and  rearranging,  we  then  find  that  the  condition  we  are  checking  can  be 
rewritten  as 


£  '^(P(M  >  j  +  k  +  t)P(M  > \)~  P(M  >  j  +  k)P{M  >  \  +  l))P(S(y)=‘\)P(S{y)  =  j) 

i<j*k 

£((P(A/>  j  +  k  +  l)P(M>A)-P{M>  j  +  k)PiM>  S  +  D) 

A<7  +  * 

P(S{y)  =  7  +  k)P(S(y)  =  A  -  A)K  0. 


or 


II  [(P(M  >  J  +  k  +  DPiM  >  A)  -  PiM  >  j  +  k)P(M  >  ik  +  D) 

A<.'  +  k 

(P(S(y)  =  A)P(S(y)  =  J)  -  P(S(.y)  =  j  + k)PiS(y)  =  k))]  ^0. 

Noticing  that 


P(M  >  7  +  A  +  DPiM  >  A)  -  P{M  >  J  +  k)P{M  >1  +  1) 


P(M  >  J  +  k  +  l) 
(P(M  >  7  +  A) 


P(M  >1  +  1) 
P(M  >  A) 


P(M  >  7  +  k)P(M  >  A)  <  0 


since  M  is  discrete  IFR  and  A  <  7  +  A,  we  see  that  it  now  suffices  to  show  only  that 
(2.7)  P(Siy)  =  A)/’(S(7^)  =  7)  -  P(S)y)  =  7  +  k)PiS(y)  =  A  -  A)  ^  0. 


We  may  take  care  of  a  number  of  cases  rather  easily.  If  A  <  0,  then  A  -  A  <  0  and  the  left  hand  side 
of  (2,7)  is  0  —  0  =  0,  If  A  ^  0  and  7  <  0,  then  A  —  A  <0  since  A  <  7  +  A.  Thus,  the  left  hand  side 
of  (2.7)  is  0  —  0  =  0.  If  0  ^  A  <  A  and  7  >  0,  then  the  expression  on  the  left  hand  side  of  (2.7) 
reduces  to  the  nonegative  quantity  P(S{y)=  A)P(S(y)=>  j).  Finally,  we  consider  the  case  where 
A  ^  A  and  7  ^  0.  First,  note  that  if  A  ^  A  and  A  <  7  +  A,  then  7  0.  If  A  =  0  then  (2.7)  is 

immediate,  thus  we  consider  A  >  0.  The  left  hand  side  of  (2.7)  is 


e  *^(\>')‘^  e  ^^{\yV  _  e  ^''(\y)’*'‘  e  ^‘(Xy)‘^~* 

A!  7!  (./+A)!  (A- A)! 

„-2Av7>  >i  +  / 

^  \T7~T~  cr  +  A)(7  +  A  -  1)  ...  (/  +  1)  -  A(A  -  1)  ...  (A  -  A  +  1)) 

A !  (7  +  A ) ! 

which  is  clearly  nonnegative  if  A  <  7  +  A.  This  completes  the  proof. 
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Next,  for  the  proof  of  Theorem  1  (iv),  we  refer  the  reader  to  the  paper  of  Esary,  Marshall,  and 
Proschan.  We  now  show  that  (v)  of  Theorem  1  follows  from  (iv)  in  much  the  same  fashion  that  (i) 
followed  from  (iii). 


PROOF  OF  THEOREM  1  (v);  Consider  a  random  variable  T  whose  life  distribution  arises  from  a 
shock  model  where  M  is  discrete  DMRL.  In  order  to  show  that  it  is  DMRL,  it  suffices  from  Proposi¬ 
tion  2  to  show  that  Fy  is  an  NBUE  distribution  for  ail  y.  However,  in  light  of  (iv),  we  need  only  show 
that  Fy  arises  from  a  shock  model  in  which  the  number  of  shocks  survived  has  a  discrete  NBUE  distri¬ 
bution.  Now  as  before  it  can  be  shown  that 


Fy(u) 


I 

*-o 


PiSyiu)  =  k) 


£  PiSiy)  =  j)P{M>j  k) 
y-O _ _ 

£  P(S{y)  =  j)P(M  >  j) 

y-O 


Again,  we  can  show  that  the  expression 


£  P(S(y)  =  j)P{M  >  j  +  k) 

y-O _ 

£  P{S(y)  =  j)P(M  >  j) 
y-O 

is  the  survival  function  for  some  discrete  distribution. 

Now  if,  as  before,  we  let  M*  be  a  discrete  random  variable  which  is  independent  of  the  Poisson 
process  [Sy(t):  t  ^  0)  and  has  this  survival  function,  then  F,,(iy)  is  of  the  form 

Fyiu)  =  £  PiSyiu)  =  k)P{M*  >  k). 

To  complete  the  proof  it  will  suffice  to  show  that  M*  is  discrete  NBUE  since  M  is  discrete  DMRL. 
That  is,  it  will  suffice  to  show  that 


(2.8) 

This  can  be  simplified  to 


\P{M*  >  /)!''  £  P(M*  >  K)  <  £  P(M*  >  k). 

k-l  Ik-0 


(2.9) 


£  PiSiy)  =  j)  £  P(M  >  j  +  k)  £  £  PiSiy)  =  j)P{M  >  j  +  k) 


y-O 


£  P(S(y)  =  j)P(M  >  j  -(-  n 


j-O  k-O 


£  P(Sii’)  =  j)P(M  >  ,/) 

y-O 


Now  after  some  algebra  we  may  rewrite  the  condition  from  (2,9)  as 


£  £  lP(S(y)  =  j)P(S(y)  =  \)P(M  >  j  -I-  l)P(M  >  A) 
y^o  aio  ' 

FiM-i  j ;  /)  .1  ‘ -  tutVt)  .1  ‘ ' 


^  0. 
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Since  for  y  >  0  or  A  <  0,  /’(5(>')  =  j)PiS O')  =  A)  =  0,  we  may  rewrite  the  preceding  as 


£  [P(S(y)  =  j)P(S(y)  ^  ^)P(M  >  J  +  l)P(M  >  a) 

yv— OO  — OO 

Tos^ttt)  i -  TtsTAy  i  >  ‘'1' « » 


By  breaking  the  summation  into  three  parts  and  using  methods  analogous  to  the  IFR  case  we  can 
justify  this  inequality  and  thus  complete  the  proof. 


3.  A  BIVARIATE  SHOCK  MODEL 


Now  consider  two  devices  which  are  subject  to  shocks.  We  will  denote  the  number  of  shocks  to 
devices  1  and  2  in  [0,  r]  by  S^it)  and  S2(t),  respectively.  Further,  we  let  (M\,  M2)  denote  the  random 
number  of  shocks  until  failure  of  devices  1  and  2,  respectively.  Block,  Paulson,  and  Kohberger  [2] 
have  unified  some  of  the  various  bivariate  exponential  distributions  in  the  reliability  literature  by  taking 
(A/),  M2)  to  be  a  bivariate  geometric  distribution  with  joint  survival  function  given  by 


P(Mi  >  m\,  M2  >  nt2)  = 


Pu'(Po.  if",,  ^  m2 

Pu^(p,o  +  Pn)”'''"^  if"i2  <  rn, 


where  Poo  +  Pot  +  f  10  +  P\\  =  K  Pio  +  Pit  <  and  po,  +  p,,  <  1,  If  we  take  S\ir)  =  $2(1)  to  be  the 
same  Poisson  process,  then  (f,,  T2)  will  have  a  Marshall  and  Olkin  bivariate  exponential  distribution. 
If  S|(/)  and  S2(t)  are  taken  to  be  independent  Poisson  processes  the  resulting  joint  distribution  of 
(T\.  T2)  turns  out  to  be  of  the  type  which  has  been  discussed  by  Downton  [4],  Hawkes  [6],  and  Paul¬ 
son  (8)  depending  on  the  choices  of  the  various  parameters.  Consequently,  in  this  section,  the  model 
in  which  the  shocks  occur  simultaneously  to  the  two  devices,  i.e.,  S,(/)  =  Sjlr),  will  be  called  the 
Marshall-Olkin  type  of  shock  model,  while  the  model  in  which  the  shocks  occur  according  to  indepen¬ 
dent  Poisson  processes  will  be  called  the  Downton-Hawkes-Paulson  type  of  shock  model.  We  will  now 
consider  various  bivariate  discrete  NBU  conditions  on  (A/,,  M2)  which  yield  bivariate  NBU  conditions 
of  the  type  considered  by  Buchanan  and  Singpurwalla  I3). 

Marshall-Olkin  Shock  Models 

Consider  two  devices  subject  to  simultaneous  shocks  occurring  according  to  a  Poisson  process. 
Letting  M^  and  M2  represent  the  number  of  shocks  until  failure  of  devices  1  and  2,  respectively,  we 
will  prove  some  results  about  //(r,,  12)  =  P(T\  >  T2  >  ti)  where  F,  and  T2  are  the  respective  life¬ 

times  of  the  devices. 

PROPOSITION  3;  If  P(M^  >  m,  +  j,  A/j  >  "I2  +  j)  ^  >  JPu  ^^2  >  W2)P(A/,  >  y, 

M2  >  J)  for  all  nonnegative  integers  m,,  m2,  y,  then  //(r,  -F  x,  ^2  -f  jc)  <  //(r,.  t2)H{x,  x)  for  all 

ti.  X  ^  0. 


PROOF.  We  will  prove  this  for  r,  ^  ,2  The  proof  for  the  case  where  r,  ^  (2  is  analogous. 
Throughout  the  proof  we  will  make  crucial  use  of  the  stationary  independent  increments  of  the  Poisson 
process.  We  begin  by  writing  for  r,  <  t2 

W(/|.  /2)  =  i;  L  P(5(r,)  =  k\.  SU2)  =  k2)P(M^  >  k|,  M2  >  k2). 

^  ,"0  *2"*  J 
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By  using  the  stationary  independent  increments  property,  we  may  rewrite  this  as 

(3.1)  I2)  =  £  P{SU,)  =  A:)  £  P(S(i2  -  /,)  =  i)P(M^  >  k.  M2>  k  +  i). 

j^o  1-0 

In  an  analogous  fashion  we  may  then  also  write  for  fixed,  but  arbitrary  x  >  0, 

7/(f,  +  X,  /2  +  x)  =  £  X  Z  {P^SitO  =  ^)P(S(x)  =  k-  ^)P(S{l2  -  ti)  =  /) 
k-O  A-0  /-O 

P(A/i  >  kM2  >  k  +  /)1 

by  using  the  stationary  independent  increments  property.  Next  by  interchanging  the  orders  of  summa¬ 
tion  and  performing  some  algebra  it  can  be  shown  that 


+  X,  l2  +  x)  =  ^ 


A-O 


PiSitf)  =  A)  £  P(S{t2  -  ti)  =  i)P(Mi  >  A,  A/2  >  A  -I-  /) 


~  P(M^  >  A  -E  y,  A/2  >  A  -E  y  +  /) 


M2  >  A  -E  /) 


By  using  the  NBU  assumption  on  (A/i,  A/2),  we  have 


H(t\  -E  X,  r2  +  ^ 


£  P(SUy)  =  A)  £  P(S(?2  -  fi)  =  i)P(Mi  >  A,  A/2  >  A  -E  /) 


A=0 


1-0 


£  PiSix)  =  y)F(A/,  >  j.  M2  >  j) 

J-O 


Rewriting  the  right-hand  side  of  the  above  inequality  we  have 
(3.2)  H (rj  +  X,  /2  +  ^  ^ 

We  remark  in  passing  that  if  the  direction  of  the  inequality  in  the  hypothesis  of  the  preceding  proposi¬ 
tion  is  reversed,  then  the  inequality  in  the  conclusion  is  reversed.  This  would  be  a  model  of  stochastic 
improvement  with  age.  We  also  note  that  the  boundary  distributions  (ones  in  which  equality  holds  in 
equation  (3.1))  in  this  case  are  the  distributions  with  the  bivariate  loss  of  memory  property.  If 
exponential  marginals  were  required,  the  boundary  would  be  the  Marshall-Olkin  bivariate  exponential 
distribution. 


PROPOSITION  4:  If  P(A/,  >  m,  -E  y,.  A/2  >  m2  -E  j2)  ^  /’(A/,  >  m,.  A/2  >  m2)P(Mx  >  yV 
M2  >  72)  fo'"  3*1  n^nnegative  integers  /W|,  m2,  j\,  j2  with  (mj  -  m2)  0)  —  Ji)  ^  0^  then 
HUx  +  xx,  t2  +  X2)  <  //(f|,f2)//(x|,  X2)  for  all  /|.  I2.  x,,  X2  >  0  with  (lx  -  h)  (->fi  “  ^2)  >  0. 

PROOF:  We  first  assume  that  /(  ^  12  and  xj  ^  X2.  As  we  have  seen  in  equation  (3.1)  of  the 
proof  of  Proposition  3 

77(,,.  ,2)  =  £  P(Siix)  =  Ac)  £  P(S(t2  -  lx)  =  i)P(Mx  >  k.  M2>  k  +  i). 

k~o  1-0 

Now  we  may  write  H (/[  -E  xi,  I2  +  X2)  as 

H(tx  +  Xx,  I2  +  ■X2^  ~  Z  Z  ■*■  -^1*  ^  k)P(Sil2  —  I|  +  X2  —  X|)  =  /■) 

k-O  1-0 

P(Mx  >  k,  M2  >  k  +  /)). 

Now  by  using  the  stationary  independent  increments  property,  interchanging  the  order  of  summation, 
and  performing  some  algebra,  we  can  show  that 
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//{/,  +  A-,,  +  -^2)  =  X  =  ^)  E  -  >0  =  rt)P(M^  >  A,  A/2  >  A  +  n) 

A=0  (  n-O 

T  P(S(ai)  =  y)  T  /»(S(a2- X,)  =  m) 

>-0  m-O 

P(A/|  >  ^  +  J,  M2  >^+j  +  n  +  m)  I 

j' 

Using  the  condition  on  (A/),  M2)  given  in  the  hypothesis  of  this  proposition  we  then  have  that 

//(f,  +  X|,  /2  +  xj)  ^  E  =  A)  E  “  '1^  =  n)P(Mi  >  A,  A/2  >  A  +  «)  ■ 

A=0  n-0 

£  P(S(x,)  =  j)  £  P(S(X2  -  X|)  =  m)P(A/|  >  j.  M2>  j  +  m)  . 

j~Q  m=0 

But  the  right-hand  side  of  the  preceding  inequality  is  just  H(i\.  t2)H(x\,  X2).  Thus,  we  have  for 
0  <  <  /2  and  0  <  X|  <  X2  that 


(3.3)  //(r,  -F  X|,  t2  +  X2)  <  //(/|.  t2)H(xi.  X2). 

For  ti  ^  ^2  ^  0  and  X|  ^  X2  ^  0,  an  analogous  argument  will  lead  to  equation  (3.3).  The  boundary 
distribution  (in  which  equality  holds  in  equation  (3.3))  turns  out  to  be  the  Marshall-Olkin  bivariate 
exponential  distribution.  Also  a  reversal  of  the  inequality  in  the  hypothesis  of  this  proposition  leads  to 
a  reversal  of  the  inequality  in  the  conclusion.  This  is  a  model  of  stochastic  improvement  with  age. 

Marshall  and  Shaked  (71  have  considered  a  cumulative  damage  model  in  which  the  damages 
(D^|  D2i)  are  independent  and  identically  distributed  random  vectors  with  a  joint  distribution  function 
G.  Then  if  d\  and  d2  are  the  fixed  failure  thresholds  of  the  devices,  then  the  survival  function  of 
(A/|,  M2)  has  the  form 

m| 

P(M\  >  m\.  M2  >  ffl2  —  P  ^  ^  E  ■®2i  ^  ^2 

.1=1  1=1 

Marshall  and  Shaked  prove  that  when  the  survival  function  of  (A/),  M2)  is  of  this  form  that  the 
hypothesis  of  the  preceding  proposition  is  satisfied.  Consequently,  the  joint  survival  function  of  the 
devices  satisfies  equation  (3.3)  for  (r,  —  12)  (x|  —  X2)  ^  0. 

PROPOSITION  5:  If  P(Mi  >  m  +  J,  M2^>  m  +  J)  ^  P(Mj^  >  m._M2  >  m)  P(M^  >  J. 
M2  >  j)  for  all  nonnegative  integers  m  and  y,  then  //(/  x.  t  +  x)  ^  H (i,  t)H(x,  x)  for  all  r.  x  ^  0. 

PROOF:  By  the  stationary  independent  increments  property  we  can  write 

H(i  +  X.  I  +  x)  =  £  ^  P(SU)  =  k)P(S(x)  =  i)P(Af|  >  k  +  J.  M2>  k  +  J). 

k-O  y-O 

Using  the  hypothesis  of  the  proposition,  we  have 

H(i  +  X.  I  +  x)  ^  £  £  P(S(t)  =  k)P(.S(x)  =  j)P{Mi  >  k.  M2>  k)P{M\  >  J,  M2  >  j). 

k-O  j-0 

The  right-hand  side  of  the  preceding  inequality  can  be  rewritten  to  yield 
H (1  +  X,  I  +  x)  K  H (i,  t)H (x,  x). 
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Finally,  we  will  conclude  this  section  with  a  counterexample.  The  question  is  whether  a  discrete 
NBU  condition 


P(M\  >  W)  +  j\,  M2  >  W2  +  ^  P(M]  >  m\,  M2  >  m2)P(M\  >  j\,  M2  >  72) 


for  all  nonnegative  integers  mi,  m2.  7i.  72  would  imply  the  continuous  analog 

H(!\  +  xi.  t2  +  X2)  ^  //(ti,  t2)H(x\,  X2)  for  all  ri,  I2,  Xi,  X2  ^  0. 


The  following  counterexample  answers  this  question  in  the  negative.  Suppose  that  M^  and  M2  are 
degenerate  random  variables  with  M\  —  1  with  probability  1  and  M2  —  2  with  probability  1.  It  can  be 
shown  that  P(M\  >  m\  +  j'l,  M2  >  m2  +72)  ^  P(M\  >  m\,  M2  >  m2)P(M\  >  j\,  M2  >  72^ 
that  if  /]  <  /2,  xi  >  X2  and  ft  +  X|  >  /2  +  ^2. 


H(t\  +  x,.  /2  +  X2)  -  H(t\,  t2)H(xi,  xi)  =  e  Y. 

k-2 


(\(/2-  tl))* 
k\ 


>  0. 


However,  an  NBU  result  along  these  lines  does  hold  in  the  case  of  the  Downton-Hawkes-Paulson 
Shock  Model. 


A  Downton-Hawkes-Paulson  Shock  Model 


In  this  model  we  consider  two  devices  subject  to  shocks  occurring  according  to  independent  Pois¬ 
son  processes.  Let  !(Si(r):  r  >  0l  and  |S2(r);  r  >  0)  be  these  processes  and  (M\.  M2)  be  the  random 
number  of  shocks  until  failure  of  the  respective  devices. 


PROPOSITION  6:  If  P(A/,  >  m,  -E7,,  M2  >  mj  +  ,/2)  <  >  mj^,  M2  >  m2)P(M\  > 

il,M2>_  J2)  for  all  nonnegative  integers  m\,  m2.  j\,  then  H(t\  +  X\,  12-^  X2) 
t2)H{x\,  X2)  for  all  fi,  t2,  X|,  X2  ^  0. 


PROOF:  By  the  stationary  independent  increments  property  f  1  an  interchange  of  the  order  of 
summation  we  have 


H {t\  -E  X],  t2  +  X2)  =  I  I  II 


“0  j^n 


/’(5|(x|)  =  m,  52(.V2)  =  n)P(M\  >  m.  A/2  >  n) 


PlSiIfi)  =  k  -  m,  82(12)  =  7  -  n) 


P(M^  >  k,  M2  >  J) 
PiM]  >  m,  M2  >  n) 


Using  the  hypothesis  of  the  proposition,  we  can  write 

//(f,  +  xi.  t2  +  X2)  <  X  I  =  "»■  ^i^xi)  =  n)P(M^  >  m.  M2>  m) 

m-0  n^O 

^  /*(5i(r|)  =  k  -  m,  82(12)  =  J  ~  '>) 

Jk-m  j^n 

P(M\  >  k  —  m,  M2  >  j  —  n)]. 


By  a  change  of  variables 


H (fj  +  X|,  22  +  X2)  < 


2^  PWiIx,)  =  m,  S2(.T2)  =  n)P(Mi  >  m.  M2  >  n) 

m— 0  o-O 


2^  £  /’(s,(f,)  =  k.  82(12)  =  J)P(M\  >  !<■  ^2  >  ./U- 

k~0  J^O  ) 

But  the  right-hand  side  of  the  preceding  inequality  may  be  rewritten  as  //(ti,  i2)H(xi.  X2),  thus  com¬ 
pleting  the  proof. 
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ABSTRACT 

Take  n  independent  identically  distributed  (HD)  observations  from  a  con* 
tinuous  r-variaie  population,  and  choose  some  order  statistics  from  each  of  (he 
r  variates  These  order  statistics  are  used  to  construct  a  grid  in  r-dimensional 
space.  Under  certain  conditions,  it  is  shown  that  as  n  increases  we  can  choose 
an  increasing  number  of  order  statistics  in  such  a  way  that  the  asymptotic  joint 
distribution  of  the  chosen  order  statistics  and  of  the  frequencies  of  sample 
points  falling  in  the  cells  of  the  grid  can  be  assumed  to  be  a  normal  distribu¬ 
tion  An  application  to  testing  independence  of  random  variables  is  given 


1.  NOTATION,  ASSUMPTIONS,  AND  STATEMENT  OF  THEOREM 


We  observe  n  IlD  r-tuplets  {A’i(/) . X,(i)).  ;  =  1 . n,  each  with  joint  cumulative  distri¬ 
bution  function  (cdf)  F„(x\ . x,),  joint  probability  density  function  (pdf)  /„(xi . x,).  We 

choose  a  set  of  order  statistics  from  (.Vid),  ....  Xi(n)),  a  set  of  order  statistics  from 

(^"2(1) . X^in)) . and  a  set  of  order  statistics  from  (A’^d),  ,  X,(n)).  These  chosen  order 

statistics  are  used  to  construct  a  grid  in  r-dimensional  space.  This  grid  defines  a  system  of  r- 
dimensional  cells,  each  cell  having  a  frequency  of  sample  points  falling  in  its  interior.  It  is  clear  from 
the  argument  of  HI  that  if  the  number  of  chosen  order  statistics  for  each  variate  is  fixed,  then  under 
mild  conditions  the  joint  asymptotic  distribution  of  the  chosen  order  statistics  and  the  cell  frequencies 
(all  properly  standardized)  is  normal.  But  for  certain  purposes,  it  is  necessary  to  let  the  number  of 
order  statistics  chosen  increase  as  n  increases.  In  this  paper  we  show  that  under  certain  conditions  we 
still  get  a  joint  asymptotic  normal  distribution. 


For  typographical  simplicity,  until  Section  3,  we  discuss  only  the  case  r  =  2,  and  we  write  the  n 

IID  pairs  observed  as  iXl,  Yl),  {X2.  Fp . iX^,.  Y'„).  The  joint  cdf  and  pdf  for  (Afi'.  Y{)  will  be 

written  as  F„(x,y),  /„(x.  y),  respectively,  and  so  are  allowed  to  depend  on  n.  G„(x),  ^„(x)  denote, 
respectively,  the  marginal  cdf  and  pdf  for  X{.  W„(x),  fj„(x)  denote,  respectively,  the  marginal  cdf  and 


pdf  for  Yf'.  Define  '(x*. 


>■*)  as  —  Fjx. 


and 


f  ”’(x*,  y*)  as  F„(x.  y)l.,_,.,.,. 
dy 


For  each  n,  choose  p„,  q„,  and 
nq„  —  np„ 

K„  =  - - -  are  all  positive  integers. 

Cft 

specified  below. 


L„  so  that  0  <  p„  <  ()i„  <  1,  and  np„,  nq„,  L„,  and 
These  quantities  also  are  to  satisfy  other  conditions  to  be 


A'l',)  <  ...  <  A'i',1  denote  the  ordered  values  of  (Afi' . X'„)>  and  F|||  <  ...  <  F('„i  denote  the 

ordered  values  of  (  Ff .  Y'„).  X*  denotes  X(„p^^  u  ~  \)l„]  and  F*  denotes  F|'„p  +  for 
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1=1 . /C„  +  1.  Let  ND(n)  (for  "No  duplication  in  n  observations")  denote  the  following  event: 

there  is  no  pair  {XI,  L,')  with  XI  among  (X* . among  (  K* .  assume 

that  (/„,  p„,  q„,  L„]  are  such  that  lim  P[ND{n)]  =  1.  This  assumption  will  be  discussed  in  Section  3: 

n— oo 

under  it,  for  all  asymptotic  probability  calculations,  we  can  (and  will)  assume  that  ND{n)  occurs  for  all 
n. 


Define  TJ  as  supjx:  G„{x)  »  0),  X\^^.2  as  infljc:  G„{x)  =  l),  Yq  as  suply':  H„{y)  =  0),  and  Y*k^+2 
as  inf(>’:  H„(y)  =  I}.  Thus,  these  values  may  depend  on  n,  and  may  be  — “  or  oo. 


Define  as  the  number  of  pairs  {XI,  Yl)  in  the  open  rectangle  in  (x,  >)-space  defined  by 
{Xl-i  <  X  <  X*  Y*-i  <  y  <  Yp,  for  /,  y  ranging  between  1  and  +  2,  inclusive.  Define  Sy  as  1  if 
the  Y'  originally  associated  with  T*  falls  in  the  open  interval  ()^*_|,  Y*)  and  as  zero  otherwise,  for 

/  =  1 . +  1  and  y  =  I . K„  +  2.  Define  Tjj  as  1  if  the  X'  originally  associated  with  Y*  falls 

in  the  open  interval  (A’*.!,  Xp  and  as  zero  otherwise,  for  y  =  1 . +  \  and  /  =  1 . K„  +  2. 

The  occurrence  of  ND{n)  then  implies  all  the  following  equalities: 

^  =  1  for  /  -  1 . +  1, 

/-I 

£  t;,  =  1  for  y  -  1 . a:„  +  1, 

/-I 

L  ^1/  +  X  ^/i  -  ■' 

/-I  i-\ 

K„+l 

(l.I)  Z  ^>1  Z  h  =  f-n  -  1  for  /  =  2 . K„  + 

/-l  /-! 

a:,+2  a:„+i 

Z  +  Z 

/-I  y-l 

A„+2  A„+l 

Z  ^'1  Z  •^<1  =  ~  1- 

f-l  /-I 


Z 


(V, 


+  z 


Sij=  L„-  I  for  y  =  2. 


K„  +  1, 


X,+2  A,+l 

Z  +  2  +  Z  •^■.A,+2  -  n  ' 

/-I  i-l 


Let  G{i  —  I,  n)  denote  G.' 


np„  +  (i  -  1)L„ 


and  C(Al„  +  1.  n)  as  X%  +2  Let  H{J  —  \  ,  n)  denote  H„  ' 


for  /■  =  1 . AT,  +  1 ,  define  C (—  1 .  n)  as  A’J, 

I 

np„  +  (y  -  \)L„ 


fory  -  1 . K„  +  I, 


and  define  H{—\,  n)  as  KJ,  //(A(„  +  1,  n)  as  La,+2  Let  g{i  -  1,  n)  denote  g„{G{i  —  1,  «))  for 

t  -  1 . +  1,  and  h{j  -  1,  n)  denote  h„{H{j  -  \ ,  n))  for  j  \ . +  1.  Note  that  these 

definiUons  imply  _  that  f„‘'’(C(i  -  1.  n),  H{K„  +  1.  n))  -  g{i  -  1,  b),  and 

F:I^HG{K„  +  1.  b),  H{J  -  1,  B))  -  h{j  -  1.  b). 
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Next,  define  X,  as  ^/n  g(i  -  L  «)  (X*-  G(i  -  L  n))  for  /  =  1 . A:„  +  1,  and  define  Xq  as  0 

and  as  0.  Define  r,  y/n  h(J  -  I .  n)  (  XJ  -  H(J  ~  1 .  n))  for  J  =  ] . +  I,  and  define 

Yq  as  0  and  as  0. 

For  /  =  1 . X„  +  2  and  7=1 . +  2,  define  Q,,  as 

F”'(G(i-  1.  «).  f/(J~  1.  n))X/g(i-  1.  n) 

+  F”*  (G(/  -  1,  n).  H(j  -  1.  n))  Y,/h{j  -  I.  n) 

-  F„"'(C(/ -  2.  «),  H(j-  \  .  n))  X,-y/g(i  -  2.  n) 

-  (GD  -  2,  n).  n))Y,lh(j  -\.n) 

-  F„"’{C(/  -  1.  n).  Hij  -  2.  n))Xjg(i  -\.n) 

-  F„<^’(G(/  -  1.  //).  HiJ  -  2.  «))  Y,.Jh{J  -  2.  n) 

+  F„'"(G(/  -  2.  n).  Hij  -  2.  n))X,.\lg(i  -  2.  n) 

+  F^^HGd  -  2,  «),  W(7  -  2.  n))  F,_,/AO'  -  2,  n) 


where  any  term  containing  Xq,  Xf,  +2,  Kq,  or  X^,  +2  is  defined  to  be  zero.  It  follows  from  the  definitions 

K  +2  **  )3 

that  2  a,  =  A-,  -  so  £  X  g,,  =  Ar„  for  a  =  1 . +  2,  Similarly,  £  £  for 

/-I  1-1  y-l  >-l  '”1 

^  =  1 . K„  +  2. 


Define  p„(/,  y^as  the  probability _assigned  by  F„(x  >>)  to  the  rectangle  (G(/-2.n)< 
X  <  C((  -  1.  n),  Hij  -  2,  rt)  <  >  <  /ffy  -  1.  rt))  for  /  =  1 . A:„  +  2,  y  =*  1 ,  ,  /f„  +  2. 

Nii-np„(i.J)  - Qo -  . K„  +  2 


Define  W,,  as 


1. 


y/npjTj) 

K„  +  2.  Then  we  have 


and  Z,, 


as 


w  — 


1, 


and 


(1.2) 


2(K„  +  1) 


£  £  y/Pnii'  ^ 

a 

„  ^,.2  -I  I 

£  £  y/JJiT)z,=  -  - 


for  a  =  1 . Af.  +  1 , 


-£  £  s,-p 

i-\  1- 


£  £  yfM)  Zi!  =  '  -  r„ - 

,-l  V" 


for/3  =  1 . K„^\. 


Let  denote  min  (j?„(x):  G-'(p„)  <  x  <  G„''(?„)|,  h*  denote  min|y)„(>');  H„  Hp„)  < 
b„  denote  minUj,  h*„\,  and  c„  denote  minlX,(x,  y);  G„ '(p„)  ^  x<  G“'(<7„), 
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We  assume  the  following: 


(1.3) 


fnix.y).  Fli^Hx.y).  F„^^Hx,y). 


i  0  , 


and 


-^fnix.y) 


are  all  bounded,  for  all  values  of  x,y,  and  n. 


(1.4) 

(1.5) 

(1.6) 

(1.7) 


min  p„{ij)  =  0 

There  exists  a  finite  value  b  so  that  b„  <  b  for  all  sufficiently  large  n. 

K'^ 

— j-j=r  approaches  zero  as  n  increases. 
b„yJn 

f<^n 

- ■y=-  approaches  zero  as  n  increases. 

c„b„-Jn 


1 


max  p„U.j)  =  0 

U.j) 


1 


Denote  by  /„  the  joint  pdf  for  the  random  variables  ISjy.  7},,  Xj.  T,,  M',,). 

Now  we  construct  an  "artificial"  joint  pdf  for  the  random  variables  {S,y,  Tji,  AT,,  Yj.  as  fol¬ 
lows.  The  2{K„  +  1)  -h  1  sets  of  random  variables  {S/i . ^ . 

(/■/i . 7'a,J  for  7-1 . fC„  +  1;  and  {2-  /  =  1 . k„  +  2;  7=1 . K„  +  2; 

iij)  ^  (K„  +  2,  K„  +  2)]  are  independent  of  each  other.  The  joint  distribution  of  |S,i . •S,  a  +2I 

K„*2 

is  given  as  follows:  each  5„  is  0  or  1,  J  S,,  =  1,  and 


PiS^,  =  1)  = 


(G(/-l./»).//0-l,n))  -  F„(l)(G(/-l.n).//0-2,n))) 
gO  -  \  .  n) 


The  joint  distribution  of  {T}) . +2I  is  given  as  follows:  each  7^  is  0  or  1,  ^  7",/  =  1,  and 

1"  1 

[Fi^HG(i-l.n).HU-Yn))  -  Fi^\GU~2.n)  .n))] 

h{j-\,  n) 

The  (fC„  +  2)^  —  1  random  variables  (2^,:  /  =  1 . K„  F  2\  7=1 . +  2; 

(ij)  ^  (K„  -y  2.  K„  ■¥  2)1  have  the  following  joint  normal  pdf: 


+  2.  K„  +  2))  ^ 


A«  +  2  A. 4-2 


exp  J-y 


I 

/-I 


1-1 


where  2i^^+2.k  +2  is  given  by  the  identity  ^  ^  yfpiJij)  z„  ”  0.  For  future  use,  we  note  that  in  this 

i-i  (-1 

distribution,  the  variance  of  Zg  is  1  -  p„{i.J),  and  the  covariance  between  2„  and  2,  ,  (where 
U.j)  ^  U'.J'))  is  -•Jp„U.J)p„U',j')-  Now  define  (AT, . l^i .  ‘erms  of 

15„.  T,,.  2,,)  by  means  of  equations  (1.2).  Defining  as  above,  define  as  2,^  +  -  .S''.—  for 

V  PnU.J) 
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i  <  K„  +  \  and  j  <  K„  +  1.  Then  define  W',/  as  the  closest  value  to  that  makes  np„(i,j)  + 
yjnp„(i.j)  W,!  an  integer  (positive,  negative,  or  zero).  Finally,  define  ^,,k„+2  by  the  equation  ^ 

_  H„  +  2  K„  +  2  K„  +  2  _  A, +  2 

y/npJJj)  W,I  =  S,  "  Z  PndJ).  ^k„+2.i  by  the  equation  -JnpjJj)  ^  ~  ” 

1-1  /-I  1-1  i-i 

A„+2  A„+2  A„-F2  _  A„+2 

2^  p„(/,7),  and  by  the  equation  ^  L  V"Pn('.7T  =  -2(a:„  +  1),  where  ^  A'(,  and 

1-1  j-i  1-1  /-I 

*-  +  2 

^  /V,,  are  defined  by  the  equations  (1.1). 

1-1 

Denote  by  /*  the  joint  pdf  for  the  random  variables  {S^,  T",,,  X,.  Y,,  If,,}  induced  by  the  process 
just  described. 

For  any  measurable  set  C„  in  the  space  of  (S„.  Ty,.  Xj.  Yj.  iVjj],  let  Pj  (C„),  PfAC„)  denote  the 
probabilities  assigned  to  C„  by  /„  and  respectively.  In  the  next  section,  we  prove  the  following; 

THEOREM;  lim  sup  \P-f  (C„)  -  (CJ|  =  0. 

/I— <»  " 

2.  PROOF  OF  THE  THEOREM. 


The  theorem  combines  the  results  of  [1],  12],  and  [3l,  and  the  proof  is  a  combination  of  the 
proofs  in  those  papers.  First  we  list  some  notation  and  elementary  results  that  will  be  used  in  the 
proof. 


The  symbol  "log"  always  means  the  natural  logarithm.  For  any  event  E,  E  denotes  its  negation. 
<t»(jf)  denotes  the  standard  normal  cdf  If  we  state  that  a  sequence  of  random  variables  |K„)  is 

0„(r(n)),  it  means  that  converges  stochastically  to  zero  as  n  increases,  for  an  arbitrary 

r(n)li(n) 

sequence  j8(n)  with  fi(n)  approaching  °°  as  n  increases.  Thus,  if  ( V„\  is  0p(r(n))  and  r(n)  approaches 
zero  as  n  increases,  then  y„  converges  stochastically  to  zero  as  n  increases. 


The  following  equalities  and  inequalities  are  well-known; 


(2.1) 

For  any  events  £1 . E„,  P(E^  O  ...  D  E„)  >  1  - 

(Bonferroni'  s  ine- 

quality). 

1—  1 

(2.2) 

For  any  a  >  0,  1  -  <t>(x)  <  e  ^  . 

(2.3) 

For  any  positive  integer  m,  log  m\=  log  27r  -f 

.  1  ,  ,  aj(m) 

m  +  -:r  log  m  -  m  + - , 

2  m 

where  lw(m)l  <  1.  (Stirling’s  formula). 

(2.4) 

If  V  is  any  random  variable  with  P(F  <  0)  =  0,  and  if  b  is  any 
F(F) 

value,  Pi.y  <  b)  >  \ - r —  (Chebyshev’s  inequality). 

b 

nonrandom  positive 

(2.5) 

If  |x|  <  1,  log  (1  +  x)  =  X - —  + - rr  where  |f)|  <  1. 

2  3(1  -f  Ox)^ 
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LEMMA  2.1:  If  Vi,  V„  are  jointly  normally  distributed,  with  zero  means,  variances 
(Ti,  ,  cr^,  respectively,  and  any  covariances,  and  if  ?  denotes  max  (o-|,  ,  a-„),  then  for  any 

c  >  0,  Z’  [max  lA",  !  <  cl  >  1  — exp  — ^  . 

/-I . m  cy/ln  20- 

PROOF:  A  combination  of  (2.1)  and  (2.2). 

LEMMA  2.2:  Under  both  f„  and 

Ik  - 

[max  \X,\/^/ngii  -  1,  «))  =  0.  .  and  [max  lYjl/Vn  h(j  -  L  n)]  =  0^ 

/-I . A-„+l  |V/10„  y-l . x„+i 


K„ 

y/nb„ 


PROOF:  We  only  have  to  give  the  proof  for  X,,  since  the  proof  for  Yj  is  the  same.  Under  f„,  we 
can  assume  the  joint  distribution  of  (AT,)  is  the  normal  distribution  described  in  [2],  and  then  set 

c  =  in  Lemma  2.1,  completing  the  proof.  Under  using  equations  (1.2)  we  note  that 

'Jno„ 

a  2(K„  +  2)  a  /— rr-^ 

“  X  X  p - ,  and  that  the  variance  of  ^  ^  -Jp„(i.  J)Zij  is  less  than 

1-1  j-1 

one.  The  result  then  follows  directly  from  Lemma  2.1. 


/v  ^  1 

LEMMA  2.3:  Under {  max  I l^^ijl/y/np„(iJ^]  =  0  -  V  . 

i.J<K„^2  D„-sln  J 

PROOF:  Let  {/3„1  denote  a  sequence  of  nonrandom  positive  quantities  such  that  lim  /3„  =  <», 

oc 

K 

and  lim  X-  =  0.  Let  denote  the  number  of  {XI,  Yl)  falling  in  the  open  rectangle 

_  n—oo  b„y/n  _  _  _ 

{G{i  -  2,  n)  <  X  <  G{i  -  1,  n),  Hij  -  1,  n)  <  y  <  H(J  -  1,  /;));  let  AJ®’  denote  the  number  of 
{X[,  F/)  falling  in  the  open  rectangle 

i  ^-1  f  rtp„  +  (/  -2)Z,„  ^  0„K„  I  ^  ^  ^  (j-i  f  np„  -I-  (/-  1)Z.„  _  p„K„  |  \ 


np„  +  (/-2)Z,„ 

np„  +  (/-  1)Z.„ 

n 

-Jnb„ 

n 

yfnb„ 

np„  +  {J-2)L„ 

^  UnKn 

np„  +  {J  -  \  )L„ 

l3r.K„ 

n 

yfnb„ 

^  y  n„ 

n 

yfnb„ 

the  number  of  (Xl 

Y,')  falling  in  the  open  rectangle 

,  np„  +  {i-2)L„ 

.er  r  <r  CT-' 

np„  +  U-  \  )L„ 

n 

■Jnb„ 

n 

^b„ 

np„  +  {J-2)L„ 

//“I 

1 

+ 

n 

Vnb„ 

n 

{G„  '(a)  is  defined  as  ATJ  if  a  <  0,  and  as  X%^+2  if  o  >  1;  '(o)  is  defined  as  Fq  if  a  <  0,  and  as 

F]^^^2iffl>l)- 

Clearly,  iVj®*  <  iVj”  <  and  by  Lemma  2.2  with  probability  approaching  one  as  n  increases, 
we  have  Nj®’  <  iV,,  <  Now  —  )Vf®'  is  the  number  of  pairs  (A',',  F/)  which  fall  in  a  region 

_  2p  K 

consisting  of  pieces  of  four  strips,  each  strip  having  probability  (under  f„)  no  greater  than  -  Z—”  . 

■Jnb„ 

8  13  IC 

Therefore,  -  A'j®’}  <  ”^7  "  ,  and  using  (2.1)  and  (2.4),  if  is  any  positive  value, 

“  'Jnb„ 
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8/1  {K  +  2)^K  j3 

P[Nij^^  -  iVj®*  <  p„  for  all  /,  7]  ^1  —  - Below  we  will  choose  /3„  and  /3„  so  that 


finb„ 


yfnb„p„ 

approaches  zero  as  n  increases;  then  we  can  say  that  with  probability  (under  /„)  approaching 


one  as  n  increases,  we  have  I^Vy  -  AfJ'M  <  for  all  /,  j. 

Af'J’*  -  np„{i.  j) 


Now  define  as  — ■  Then  with  probability  (under  /„)  approaching  one  as  n 
y/np„U,j) 


increases. 


ij _ 


■sfnpjTj)  ylnp„U.  j) 


max 


'JnPn^i.  y)  ‘  " 


K„yl\o%K„ 


< 


~  np„{i.j) 


for  all  i,  j.  But  by  [3]  and  Lemma  2.1, 


.  Since  can  increase  arbitrarily  slowly,  we  can  let  /3„  increase  at 


\fnK^ 

any  rate  that  makes  — —  approach  zero  as  n  increases.  This  completes  the  proof. 


LEMMA  2.4:  Under /*,  {  max  I  np„(i,  y)!  =  0^ 


i.j<K„+2 


K 


b„Jn 


PROOF:  For  i  and  j  both  less  than  or  equal  to  A",  +  1,  is  within 

Qu 


2.WU) 


Zy  + 


J) 


.  Applying  Lemma  2.1  to  (Zyl  and  {Qj,  we  immediately  find  that  under  7^, 


max 


\/np„(i,  7) 


=  0„ 


b„Vn 


-ijt  yyg'ijf 

K,+2 


Since  ^  V np„{i,  j)  is  Qp(K„),  we  see  that 


f-i 


l^^l.X,.2l 

— n 

sanH 

l^^K„.2.7l 

mQ  V  — 

sO 

m3X  /  ' 

i<K„+\  yJ np„{i,K„-\-2) 

b„Vn 

dllU 

'Jnp„(.K„+2,  j) 

^p 

b„Vn 

From  the  definition  of  k„+2  under  f^,  it  follows  that 


b„Vn 


V«P„(A„  +  2.  A„T2)  ^ 


VJ, 


.  This  completes  the  proof. 


LEMMA  2.5:  At  every  point  (5y.  Tj,.  A,,  Yj,  W^j)  at  which  f„  is  positive,  we  can  write 

_  _  _  1  A„+2 

log/„(Sy.  Yj.  iFy)as^,(«)-|  £  £  Zij  +  ^^in) 


+  Z,  L  'og 

/-I  j-\ 

*■  +1  A„  +  2 

+  I  I  7;,  log 

y-1  ,-l 


7-1  .-1 

F„“>(C(/-1,  n),  n))  -  F^^HGU- L  n).  n)) 


g{i-  1,  n) 

FJ^UGO-  1,  n),  H(J-  1,  n))  -  FJ^HGO-2,  n),  H(J-  1,  n)) 


h{J-\,  n) 


where  A,(n)  is  nonrandom,  and  A|(/i)  converges  stochastically  to  zero  as  n  increases  under  both  f„ 
andy’. 
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_  PROOF^  From  the  multinomial  formula,  assuming  that  the  event  ND(n)  occurs,  the  joint  pdf  for 
(5,,  f„.  jf.  yi-  is  the  product  of  the  following  four  expressions: 


n\ 


Y1)  -  F„(xu.  yp  -  fr(^r.  y:-i)  + 

j-i  ,-i 


jV  ' 


n  n  yj)-fj''(n  f;_,)p 

,-l  y-1 

fl  fl  yp  - 

y-1  /-I 

if  jV,y,  5y,  7j,  are  nonnegative  integers  satisfying  the  equations  (1.1),  and  ^5  <  A']'  <  . , .  <  XX  and 
Fq  <  •  ■  •  <  ^A„+2;  the  Joint  pdf  is  zero,  otherwise.  From  this  it  follows  that  when  /„  is  positive,  we 
can  write  log  Tj,.  X,,  Yj,  W'y)  as  the  sum  of  the  following  six  expressions: 


(2.6) 

(2.7) 

(2.8) 


log  «! 

A„  +  2  A„  +  2  _ 

"  Z  Z  log  (np„(i,  J)  +  yJnp„U.  j) 


y-1  /-I 


z  z 

y-l  i-l 


np„U,  j) 

+  V  y)w^y 


X  log 


Gd-ln)  +  - 


X, 


V  rtg(/-l,n) 


h  (J—  1 ,  ) 


C?(/-2,n)  + 


G  (i—\,n)  + 


G(i—l,n)  + 


AT,. 


yfng(i-2.n) 

X> 

Vnf  (/— 1,«) 

■y,-i 

yfngU—l.n) 


.  H(J-\.n)  + 


,  H{j-2.n)  + 


,  H  {J—2,n)  + 


\fnh(  J—\,n) 


'y-i 


\fnh  (j-2,n) 


'.1-1 


yfnh(j~2.n) 


A„+l  A„-t-2 

(2.9)  115,  log 

1-1  y-1 


A,+l  A„+2 

^2.10)  £  2:  ^2vlog 

y-1  /-I 


r(l) 


G  ( /—  1 .  W  )  -h 


_^<1) 


C(/— l.n)  + 


yfn  gU-\.n) 
AT, 


Fi^MC(/-l,n)  + 


-F, 


(2) 


G(i—2,n)  + 


-Jng{i—\,n) 

X, 

'/n  g(i-\,n) 

X.-, 


.  H(J-ln)  + 


.  H(j~2.n)  + 


\fnh{J~yn) 


yfn  h  (J—2,n) 


^/ng{i—2.n) 


H(j-\.n)  + 


,  H{J-\.n)  + 


Vn  h  (,/■—  1 , 21 ) 
Y, 


yTn  h(J-l.n) 
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(2.11) 


AC„+I  K^  +  \ 

—  ^  log  (V«  g(/— l,w))  —  ^  \og  (y/nh(J~^<n)). 

,-l  y-l 


log 


Now  we  carry  out  the  following  sequence  of  calculations. 

Apply  (2.3)  to  each  term  in  (2.7),  and  write  log  (np„(i,j)  +  'J np„{i,j)  Wij)  as  log  np„(i,j)  + 


I  + 


W., 


y/np„{i,  J) 


,  then  apply  equation  (2.5)  to  log 


1  + 


y/np„U  j) 


for  all  i,  J. 


Expand  F„ 


Gii-l.n)  + 


treating 


■JngU—\,rt) 


,  H(J-\,n)  + 


and 


yj 


around  ((?(/— l,n). 


Vrt  h  (j—\,n) 
as  infinitesimals,  and  stopping  with  terms  qua- 


J~n  g(i-\.n)  yfn  h(j—\,n) 
dratic  in  these  infinitesimals,  for  /  =  1 . +  1  and  7=1,  ....  K„+  \.  This  allows  us  to  write 

the  log  in  (2.8)  as  log  [p„(i,  j)  +  +  R„{i.  y)],  where  max  |/?„(/,  7)  I  =  0 

V  n  ij 


A  A  +2 


nb‘ 


A  +2 


.  We  note  that  ^ 


Qo 


i-i 


^  R„(i.j  )  =  0  and  ^  Q,j  =  ^  log  [p„(;,  7)  +  “7^+  7')]  as  log  p„(i,  j) 

/-I  /-I  /-I 


log 


1  + 


RnUj) 


sTrtp^ii.j)  P„(i.j) 
G(i-l.n)  + 


and  apply  equation  (2.5)  to  this  last  logarithm,  for  all  7. 


Expand 

treating 


A", 


yfn  g(i—l,n) 


,  //(7-l,«)  + 


'/n  h(j-\,n) 


around  (C7(/— 1,«), 


as  infinitesimals,  and  stopping  with  terms  linear  in 


g(i—l.n)  -/n  h(j—l,n) 

these  infinitesimals,  for  /  =  1 . +  1  and  7  =  1,  +  1.  Then  treat  the  resulting  logs  in 

(2.9)  a:,  the  logs  in  (2.8)  were  treated. 

For  (2.10),  expand  instead  of  Fj",  and  then  proceed  similarly.  After  carrying  out  the  steps 
indicated,  the  proof  of  Lemma  2.5  follows  directly  from  Lemmas  2.2,  2.3,  and  2.4,  and  assumptions 
(1.3)-(1.7). 

LEMMA  2.6^  every  point  (5^.  Ty,.  X,,  Yj.  It'y)  at  which  f*„  is  positive,  we  can  write 
\ogn(S„.T„.X,.Y,.W,^)2^s 


1 


A. +2  A  +2 


-  T  L  Z  zj  +  Ayln) 

^  i- 1  1 


+  L  Z  •S'-y  log 

i-l  ;-l 
A.+l  A. +2 

+  I  I  7-,  log 

/-I  /-I 


f„"’(G(/-l,n),  H(j--\.n))  -  (C(/-l .n) .  H(j-2.n)) 


g(i-\,n) 

Fj^>(G(i-l.n).  W(7-l,n))  -  F„'^>  (G(i-2.n) ,  //(j-I.n)) 


hU-l.n) 


where  ,4j(n)  is  nonrandom,  and  A 2(22)  converges  stochastically  to  zero  as  n  increases  under  both  /„ 
and 
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PROOF:  From  the  construction  of  f*,  we  have  that  if  Tj,,  X,,  Yj.  is  positive,  it  is  the 

product  of  four  expressions: 

(2  12)  Tt'  *n  ^0-1.”))  -  Fjj'HGii-l.n),  H(j-2.n)) 

3  gU-l.n) 

(2  13)  Yr'  *n  ^0-1-”))  -  Fi^>(G{i-2.n).  HU-\.n)) 

Ji  h(j-Ln) 


F„^'HG(i-\.n).  H(J-\.n))  -  Fj”  (C(/-l.«),  Hij-2,n)) 
g(i—l,n) 

Fj^>(G(i-l.n).  N(j-Ln))  -  (G(i-2.n).  ^Q-l.w)) 

A(/-l.n) 


(2.14) 

(2.15) 


/  -  /exp 


(p„(X„  +  2.  X„  +  2))  ^ 

dip,,  ■  ■  ■  yF. 


AC„+2  A„+2 

-III  zj 

^  y-l  .-I 


where  y(n)  is  the  absolute  value  of  th^determinant  of_the  transformation  used  in  going  from  lZ,yl  to 
(Af,,  Yj.  W^j]  for  given  values  of  (Sy.  7),),  and  R„  is  the  region  given  by  the  inequalities 


W,, 


1 


2y/npJJj) 


^0  < 


W,j  + 


2y/np„(U) 


:  ij  <K„  +  \ 


Since  the  transformation  used  in 


going  from  (Zyl  to  [X,,  Yj,  fFy)  is  linear,  Jin)  does  not  depend  on  any  of  the  variables.  The  proof  of 
Lemma  2.6  is  completed  by  treating  (2.15)  as  the  analogous  expression  was  treated  on  pages  145  and 
146  of  [3]. 


LEMMA  2.7:  Let  R*  denote  the  region  in  (5y.  7}/.  Xj.  Yj,  fFy)*space  where  /J  is  positive  and  f„ 
is  zero.  Then  lim  PfiR*)  =  0. 

n-«  "  " 

PROOF:  R*  consists_of  those  points  where  at  least  one  of  the  quantities  np„ii,j)  +  yjnp„ii.j)  1F„ 
is  negative.  Thus,  Pf.^  iR')  =  Prinp„iU)  +  y/np„iij)  )F„  >  0,  all  i.J).  Using  Lemma  2.4  and 
assumption  1.6,  it  follows  that  P/-.  (/?J)  approaches  one  as  n  increases,  proving  Lemma  2.7. 


Now  define  the  random  variable  by  the  equation 


log 


f„.  X,.  Yj.  W,j) 
niS,j.  %.  X,.  Yj.  W>j) 


=  Axin)  -  A^in)  +  F„. 


It  follows  from  Lemmas  2.5,  2.6,  and  2.7  that  P„  converges  stochastically  to  zero  as  n  increases,  under 
both  /„  and  /*.  This  implies  that  there  are  sequences  {e„),  l6„)  of  positive  nonrandom  quantities,  with 
lim  «„  =  0  and  lim  8„  =  0,  such  that  the  region  Rin)  where  I  F„|  <  e„  has  probability  at  least  1  -  S„ 

n—oo  _ 

under  both  /„  and  /*.  Then  we  have 


(2.16) 

(2.17) 

(2.18) 


1  -8„  < 


-  s ' 

r  7  -  'tiin)-A2in)  T  ^ 

(1 -8.)^“"  </,,„, 


and  from  these  three  inequalities, 

(2.19)  (1  -  8„)  <  e‘» 

(2.20)  e'''''"~'''''”(l  -8„)e'‘"  <  1. 

It  follows  from  these  two  last  inequalities  that  A^in)  —  Aiin)  converges  to  zero  as  n  increases.  By  the 
reasoning  used  in  [2]  or  [31,  this  completes  the  proof  of  the  Theorem. 
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3.  FURTHER  DISCUSSION  OF  THE  THEOREM 

The  theorem  can  be  generalized  in  various  ways.  The  subscripts  of  the  chosen  order  statistics 
+u-\)L  1  and  ^v„+0-nL„i  “n  be  replaced  by  subscripts  which  are  not  evenly  spaced,  and  different 
spacings  can  be  used  for  the  X ’s  and  the  Y ’s.  Also,  assumption  (1.4)  is  convenient,  but  not  essential: 
it  was  not  made  in  [3]. 

The  Theorem  was  proved  for  the  bivariate  case  for  convenience.  It  is  easy  to  see  how  it  can  be 
proved  for  cases  of  higher  dimensions.  For  example,  suppose  we  observe  n  IID  triplets 

(AT,',  y;.  Z{) .  iX’„.  Y'„.  Z‘„).  We  define  X*.  Y*.  Z‘  for  /  =  1 . A:„  +  1,  as  above.  N,,^  is 

defined  as  the  number  of  triplets  (X].  K,'.  Z,')  in  the  open  cube  in  (x.  y,  z)-space  defined  by 
<  X  <  X*,  T*-i  <  y  <  Y'j,  ZJ_,  <  2  <  ZJ).  The  indicator  variables  {S,y),  [T,,]  are  now 
replaced  by  the  following  indicator  variables:  Sy*  =  1  if  the  triplet  (A",  Y',  Z')  from  which  X*  came  is 
such  that  F*_i  <  Y'  <  Y*  and  ZJ_|  <  Z'  <  Zl  and  =  0,  otherwise;  7',*  =  1  if  the  triplet 
{X',  Y',  Z')  from  which  K*  came  is  such  that  X*-,  <  X'  <  Xfand  Z*_i  <  Z'  <  Zl  and  Ty*  =  0,  oth¬ 
erwise;  t/yk  =  1  if  the  triplet  iX',  Y'.  Z')  from  which  ZJ  came  is  such  that  Xl_^  <  X'  <  XI  and 
F*-i  <  Y'  <  F*  and  t/y*  =  0,  otherwise.  The  rest  of  the  development  follows  the  two-dimensional 
case  almost  exactly.  The  necessary  modifications  of  the  conditions  (1.3)-(1.7)  are  easily  made  by  fol¬ 
lowing  the  computations  in  Lemma  2.5. 


Finally,  we  discuss  the  event  ND(n).  The  assumption  that  ND(n)  occurs  is  used  only  to  guaran¬ 
tee  that  the  equations  (1.1)  hold.  Since  these  equations  hold  by  construction  under  /;,  we  only  investi¬ 
gate  ND(n)  under/,. 

In  the  special  case  where  XI  and  F,'  are  independent,  Pj  lNDin)]  is  equal  to 

K  ^ 

,  and  this  approaches  one  as  n  increases  if  — —  approaches  zero  as  n  increases. 


+1 


In  cases  where  XI  and  F,'  are  not  independent,  we  proceed  as  follows.  Let  denote  the 

total  number  of  pairs  (T,',  F,'),  with  X;  <  AT*  and  F'  <  F^.  Define  S|(a,/3)  as  1  if  the  F'  originally 
associated  with  T*  falls  below  F^,  and  zero  otherwise;  define  S2(a./3)  as  1  if  the  F'  originally  associ¬ 
ated  with  T*  is  equal  to  F^,  and  as  0  otherwise;  define  Siia.fi)  as  1  if  the  F' associated  with  T*  is 
greater  than  F^,  and  zero  otherwise.  Define  as  1  if  the  Aforiginally  associated  with  fp  falls 

below  XI,  and  zero  otherwise,  define  7'2(/3.a)  as  1  if  the  A' originally  associated  with  FJ  is  equal  to 
X;,  and  zero  otherwise;  define  Tjifi.  a)  as  1  if  the  X’  originally  associated  with  F|  is  greater  than  T*. 
and  zero  otherwise.  Thus,  5|(a,  /3)  -F  S2(a,  /8)  -F  53(0:,  /3)  =  1,  a)  +  Tiip,  a)  +  a)  =  1, 

and  52(0,  /3)  =  a)  with  probability  one.  It  is  easy  to  write  the  joint  pdf  for  M„(a,  /3),  T*  FJ, 
5|(a,  /3),  Si^a,  /3),  53(0,  /3),  Fil/S,  a),  Ti(^.  a),  Tjip,  a),  by  using  the  multinomial  formula.  From 
this  joint  pdf,  and  the  fact  that  under  our  assumptions 


n 


F„{G(a-\.n). 


C(a-  1,«)|,  and  I  F^  -  //(/3-  l,n)l 


all  converge  stochastically  to  zero  as  n  increases,  uniformly  for  a,  0  =  1 . -F  1,  it  is  easy  to 


show  that  Pf  (S2(a,  j3'  =  1)  — 


./„(C(a  -\.n).H{l3-\.n) ) 
n  g(.a—  \.n)h(,p—  l.n) 


A„+l  A„+l 

a,j3=l . K„  +  \.  Thus,  (52(a./3)  =  1)  =  0 


nb‘ 


uniformly  for 


and  approaches  zero  as  n 


increases,  by  assumption  (1.6).  Since  the  event  |S2(a.  /3)  =  U  is  the  same  as  the  event  that  T*  and  F^ 
were  originally  observed  as  a  pair,  if  follows  from  (2.1)  that  (b/D(n))  approaches  one  as  n 

increases,  under  our  assumptions. 
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4.  A  TEST  OF  INDEPENDENCE 

Suppose  that  the  problem  is  to  test  the  hypothesis  that  X’  and  V,  are  independent.  Since  a  strictly 
monotonic  transformation  applied  to  each  XI  does  not  affect  independence  of  XI  and  Y'l  a  reasonable 
test  of  independence  should  not  depend  on  the  values  of  the  order  statistics.  We  will  construct  a  test  of 
independence  based  on  lAy). 


L„  1 

Choose  p„  =  \  -  q„  =  — .  Then,  if  the  hypothesis  of  independence  is  true,  p„(/.  j)  =  - - ^ 

w  ( Kfj  +  2) 

for  all  /  and  j.  ^  Also,  it  is  easily  shown  that  under  the  hypothesis  of  independence.  Variance 


= 


K„  +  \ 


K„  +  2 


IV,:.  Wn]  = 

'  '  (K„  +  2)^ 


.  Covariance  ( IV,y.  =  Covariance  { (V, 
I 


A„  +  1 


Covariance 


{K„  +  2)^ ' 

if  /■  ^  ('  and  j  f.  These  formulas  can  also  be  assumed  to  hold  if 


(K„  +  2yp„{i,  j)  approaches  one  as  n  increases,  uniformly  in  i  and  J,  and  all  asymptotic  probability  cal¬ 
culations  will  be  correct. 


Define  Q(«)  as  ^  ' - 


j-\  i-i 


yfTi 


and  note  that  Q(n)  is  observable.  Then, 


if  {K„  +  2rp„(.i,  j)  approaches  one  as  n  increases,  uniformly  in  i  and  j,  the  asymptotic  distribution  of 

r  1 

Q(n)  is  normal,  with  mean  {K„  +  \)^+n  ^  ^  [{K„  +  2)p„{i,  J)  -  |  and  variance 

j-i  i-\ 


K„  +  2 


2(K„  -t-  1)^  Then  we  have  the  following  test  of  independence  of  XI  and  K,';  Reject  the  hypothesis  if 

Q(n)  +  ^  where  «  is  the  desired  asymptotic  level  of  significance.  The 

^2(K„+\)^ 

asymptotic  power  of  this  test  is 


1  - 


$  -1(1  -  a)  - 


K„+2  A,„+2 

V2(A„ -1-1)2 


(K„  +  2)p„(i,  J)  - 


1 


A„-t-2 
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ABSTRACT 

We  consider  groups  of  tests  for  personnel  selection  purposes  in  which  each 
test  has  a  known  a  priori  probability  of  being  failed,  such  failure  resulting  in 
outright  rejection  and  termination  of  testing.  Each  test  has  a  fixed  cost  and 
given  duration.  We  consider  the  minimization  of  the  total  expected  cost  due  to 
both  the  fixed  costs  and  the  delay  costs  when  the  tests  may  be  conducted 
sequentially  or  in  parallel.  In  the  latter  situation,  a  heuristic  algorithm  is  pro¬ 
posed  and  illustrated. 


1.  INTRODUCTION  AND  MOTIVATION 

Consider  the  problem  of  determining  the  suitability  of  a  job  candidate  for  a  particular  position. 
The  personnel  manager  bases  his  Judgment  on  an  zt-dimensional  profile— each  dimension  being  a  score 
on  a  particular  aspect  of  relevance  to  the  candidate’s  acceptability.  Our  interest  is  in  determining  the 
optimal  order  of  assembling  this  profile  given  that  any  single  very  low  score  can  cause  the  candidate’s 
rejection  no  matter  what  the  nature  of  the  remainder  of  his  profile.  Moreover,  the  various  scores  are 
based  on  tests  which  are  assumed  to  be  independent  so  that  our  problem  differs  from  that  of  a  corre¬ 
lated  battery  of  tests,  all  of  which  are  used  to  predict  the  individual’s  success  (see,  for  example.  Cron- 
bach  and  Glazer  [3]).  Our  assumptions  are  reasonable  when  the  dimensions  represent  aspects  as 
diverse  as  health,  security  rating,  intelligence,  experience,  vocational  interest,  and  personality. 

For  each  test  there  is  a  predetermined  critical  score  and  a  candidate  whose  score  is  below  this 
level  is  rejected  outright  and  no  further  testing  is  required.  We  assume  that  the  proportion  of  the 
population  who  may  fail  in  this  way  is  known  for  each  test.  Conducting  each  test  involves  a  fixed  cost 
as  well  as  a  waiting  time  for  the  receiving  of  the  results.  The  object  is  to  arrange  the  tests  so  as  to 
minimize  the  expected  costs  of  testing  and  delay  in  reaching  a  decision.  The  decision  will  either  be: 

(i)  rejection  due  to  a  "failure,"  or 

(ii)  that  based  on  the  complete  profile. 

We  are  not  concerned  here  with  how  the  personnel  manager  is  to  relate  his  decision  to  the  candidate’s 
profile  in  cases  when  the  latter  passes  each  test— he  may  even  automatically  accept  all  such  candidates. 

It  is  our  belief  that  in  many  employment  decision  problems,  the  situation  described  above  is  more 
realistic  than  the  common  formulation  of  sequential  testing.  In  the  latter  case  the  suggestion  has  been 
to  place  "the  selection  device  that  has  the  highest  correlation  with  job  success  (validity)"  first  (Beach  [2] 
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p.  239).  This  approach  completely  ignores  the  role  costs  of  testing  must  play  in  determining  the 
optimal  sequence.  Moreover,  rather  extensive  research  has  to  be  done  on  the  correlations  between  the 
various  tests  and  the  success  criterion  for  each  different  job.  Our  approach  provides  a  set  ordering 
based  only  on  two  types  of  information— the  costs  associated  with  each  test  ant}  the  probability  of  fail¬ 
ing  each  test.  This  sort  of  information  may  often  be  available  from  general  sources  and  does  not 
require  new  experimental  information. 

We  also  consider  the  situation  were  one  does  not  necessarily  wait  for  the  results  from  a  completed 
test  before  conducting  a  new  one.  This  possibility  leads  us  into  the  realm  of  parallel  testing  procedures 
and  it  is  here  that  the  major  contributions  are  made  based  on  heuristic  algorithms. 

2.  SEQUENTIAL  TESTING 

The  personnel  manager  wishes  to  obtain  the  candidate’s  score  on  n  tests  Tj . r„  in  order  to 

consider  his  job  suitability.  Each  test  has  a  veto  failure  level:  a  score  below  this  level  (a  failure) 
automatically  precludes  the  applicant  from  taking  the  job.  In  the  interest  of  saving  time  one  would  sug¬ 
gest  starting  all  tests  immediately,  on  the  other  hand,  if  there  is  a  good  chance  of  a  candidate  failing  a 
particular  test  it  may  be  worthwhile  waiting  for  the  results  on  this  test  since  there  is  a  good  chance  of 
saving  the  fixed  costs  associated  with  further  tests. 

For  the  case  in  which  testing  can  only  be  performed  sequentially  we  formalize  the  discussion  by 
considering  the  following  assumptions; 

ASSUMPTION  1:  The  nature  of  the  tests  is  such  that  the  event  of  passing  a  particular  test  is 
independent  of  that  for  any  other  test. 

ASSUMPTION  2;  The  a  priori  probability  p,  that  the  candidate  may  fail  test  7^  is  known  for  each 
/=  1.  2 . «. 

ASSUMPTION  3:  The  cost  of  waiting  to  make  a  decision  based  on  the  test  results  is  y  units  per 
unit  time.  (Decision  is  made  according  to  (i)  or  (ii)  of  Section  1.) 

ASSUMPTION  4:  Tests  may  not  be  run  simultaneously,  nor  may  a  test  be  started  while  the 
results  on  another  are  pending. 

Assumption  2  involves  knowledge  of  the  probability  that  the  candidate  will  fail  each  of  the  tests. 
These  numbers  can  either  be  determined  from  general  population  proportions  or  from  proportions 
appropriate  to  the  subclass  to  which  the  applicant  belongs.  In  other  words,  after  a  preliminary  inter¬ 
view,  or  from  the  job  application  form,  the  personnel  manager  may  be  able  to  determine  to  which 
broad  category  the  candidate  belongs  and  thus  estimate  the  appropriate  failure  probabilities.  If  these 
probabilities  differ  from  category  to  category  then  so  may  the  optima)  sequencing  of  tests. 

Assumption  4  wilt  be  reviewed  in  Section  3. 

Denote  by  c,  the  fixed  cost  associated  with  conducting  lest  T,,  and  let  d,  be  the  duration  of  this 
test.  Duration,  in  some  contexts,  may  refer  to  the  actual  time  span  of  the  test  or,  it  may  refer  to  the 
waiting  time  for  the  results  to  be  obtained.  In  the  latter  case,  the  actual  physical  involvement  of  the 
candidate  in  the  test  will  be  considered  negligible  (see  Section  3)  as  far  as  time  is  concerned. 

Under  Assumption  4  we  can  associate  Che  cost 
Qi  =  c,  +  -y  d, 

with  test  T,,  since  there  is  no  parallel  testing. 
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RESULT  1:  Under  Assumptions  1,  2,  3  and  4,  the  optimal  testing  sequence  is  determined  by 
quantities  (a,/p,)— the  test  with  the  smallest  value  is  conducted  first  and  so  on. 

PROOF:  For  a  given  ordering,  say  T\,  T2,  ,  T„,  the  expected  cost  is  given  by 

(1)  £(0  =  01+  (1  -/7|)a2  +  (l~Pl)  (1  -P2)fl3  +  . . .  +  (1  -pi)  ...  (1  -  p„-|)o„. 

Suppose  we  change  the  order  of  tests  7}  and  7;+,.  The  relevant  contribution  to  E(C)  in  (1)  is 
(]  -p()  (1  -p2^  -  -  f  +  0  -p/)a,+j) 

whereas  under  the  new  order,  it  is 

(1-p,)  (I-P2)  (1-A-i)  (a+1  +  (l-A+i)fl,). 

the  other  terms  remaining  unchanged.  From  here  it  is  clear  that  T,  should  appear  before  7;+^  if 
a,/p,  <  fli+i/pi+i,  and  the  result  follows  straightforwardly. 

The  above  proof  is  a  simple  application  of  the  "adjacent  pairwise  interchange  method"  (see,  for 
example,  Baker  III). 

3.  PARALLEL  TESTING 

The  restriction  that  no  test  be  commenced  before  the  previous  results  are  known  is  an  artificial 
restriction  for  most  personnel  selection  situations.  If  time,  and  hence  money,  can  be  saved  by  running 
several  tests  simultaneously  then  this  option  should  also  be  investigated.  If  it  is  financially  important 
for  a  firm  to  fill  its  vacancy  (or  vacancies)  as  quickly  as  possible  (that  is,  y  is  large),  then  it  will  become 
more  cost  efficient  to  run  several  tests  in  parallel.  Depending  on  the  value  of  y  and  the  set  up  costs, 
this  does  not  necessarily  mean  that  all  the  tests  should  be  started  at  the  outset.  We  therefore  turn  to 
the  problem  of  approximating  the  optimal  testing  program. 

A.  The  Size  of  the  Problem 

Here  we  drop  Assumption  4  and  allow  several  tests  to  be  "run"  at  the  same  time— for  the  person¬ 
nel  selection  situation  a  test  is  "running"  as  long  as  its  result  is  still  pending— the  actual  active  part  of 
the  test,  as  far  as  the  candidate  is  concerned,  is  considered  negligible.  In  other  words,  we  consider 
situations  in  which  processing  and  making  the  results  available  to  the  employer  take  considerably  longer 
than  the  administration  of  the  tests  themselves.  People  involved  in  personnel  selection  are  well  aware 
of  these  procedural  delays.  To  get  some  idea  of  the  size  of  the  problem,  we  first  consider  how  many 
possible  arrangements  of  the  n  tests  exist  when  an  arbitrary  number  may  be  conducted  at  the  same 
time.  The  optimality  problem  is  significantly  simplified  by  the  observation  that  there  is  no  point  start¬ 
ing  a  particular  test  when  no  other  test  is  finishing— the  fixed  costs  being  incurred  in  any  case— and,  in 
the  interest  of  minimizing  delay,  one  might  as  well  advance  the  start  until  the  end  of  the  most  recently 
completed  test. 

The  various  possible  configurations  can  be  represented  by  trees.  For  example,  for  n  =  3  tests, 
consider  three  possible  configurations 
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The  first  configuration  (i)  indicates  that  test  Tx  is  started  first,  followed  simultaneously  by  Ti  and  Ty. 
W  e  can  use  the  length  of  lines  to  indicate  the  relative  durations  and  label  nodes  by  the  number  of  the 
test  just  completed,  using  0  to  indicate  the  time  origin.  The  above  configurations  are  then  represented 


The  number  of  such  trees  is  given  by  T(n)  =  (n+  l)'’^'  which  corresponds  to  the  number  of 
configurations  of  n  tests  (see  Harary,  [5]  chapter  11)  and  for  n  =  3  there  are  16  such  possible  arrange¬ 
ments. 

For  the  testing  situation  there  may  be  somewhat  fewer  possible  arrangements  if  the  "branch  dis¬ 
tance"  from  the  origin  is  the  same  for  two  different  nodes.  For  example,  if  d\  ~  dy  then 


1 


are  effectively  equivalent  as  far  as  deciding  when  to  start  Tj. 

To  obtain  a  lower  bound  A  (n)  on  the  number  of  effectively  different  configurations,  we  consider 
the  case  when  all  the  tests  have  the  same  duration.  It  is  not  hard  to  see  that  the  following  recurrence 
relation  holds: 

n-i  ( nl 
j-o  I 

Some  values  of  T(n)  and  A  (n)  are  given  below: 

"  12  3  4  5  6 

r(r)  1  3  16  125  1296  16807 

A(n)  1  3  13  75  541  4683. 

In  any  given  situation  with  n  tests  of  possibly  different  duration  times,  the  number  of  different  arrange¬ 
ments  will  be  between  the  two  numbers  A  (n)  and  T(n). 

Seeing  that  finding  the  optimal  arrangement  by  complete  enumeration  quickly  becomes  prohibi¬ 
tive,  we  now  suggest  some  heuristic  algorithms  to  provide  good  suboptimal  schedules.  The  efficiency 
of  the  algorithms  is  discussed  for  some  simple  examples— a  more  thorough  investigation,  possibly  using 
the  sampling  approach  of  Dannenbring  (4),  being  postponed  at  this  stage. 

B.  Single-Step  Check  Algorithm 

Using  the  notation  of  Section  2  we  describe  the  algorithm  as  follows: 

(I)  Order  the  tests  according  to  iajpi)  values  as  in  the  sequential  case.  Assume  the  resulting 
order  is  ( Ti .  T„). 
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(11)  Compare  the  expected  costs  up  to  the  completion  of  all  tests  T\,  ....  7",  when  T,  is 
started  at  the  last  node  of  the  current  configuration  of  (T\,  T2,  .  ..  7^-i),  with  that 
incurred  if  T,  were  started  simultaneously  with  the  most  recently  started  test.  Choose  the 
configuration  with  the  smaller  expected  cost  and  then  proceed  to  consider  Ti+\  similarly, 
until  all  n  tests  have  been  scheduled. 


The  number  of  comparisons  required  is  n  —  1;  a  typical  succession  of  steps  for  n  =  5  may  appear 
as  follows: 


0. - .  1 


(i) 


5 


Note  that  moving  from  stage  (ii)  to  (iii)  the  tree 


(3) 


3 


is  not  considered  according  to  (II)  above.  We  could  have  considered  the  single  step  procedure  which 
would  at  each  stage  compare  trees  such  as  (3)  with  that  chosen  (iii).  However,  this  latter  algorithm  has 
the  disadvantage  of  not  being  able  to  potentially  produce  both  the  purely  sequential  as  well  as  the  com¬ 
pletely  simultaneous  schedules  (unless  all  tests  are  of  the  same  duration).  It  seems  more  important  to 
consider  single-step  check  procedures  that  can  take  in  both  extremes  as  against  those  that  do  not  skip 
possible  starting  nodes  (such  as  occurs  when  (3)  is  missed).  If  the  durations  d,  are  all  equal  (say  1 
unit)  then  the  above  mentioned  variations  of  (II)  are  equivalent. 

In  fact,  the  required  comparisons  can  be  made  quite  simply  when  <7,  =  1,  for  all  /.  Note  that  since 
the  tests  are  run  in  parallel  and  4  =  only  the  fixed  cost,  c,,  of  the  test  need  be  considered.  Without 
loss  of  generality  we  take  k  =  1  here  and  in  the  sequel.  At  stage  i  we  have  typically 

t-l 
i-2 

i-k 

T+\ 


where  r,  7  -F  1  denote  units  of  time  from  the  origin.  Suppose  the  probability  of  reaching  t  is  denoted 
by  (1  -  77  J.  If  7"^  is  started  at  7  then  the  relevant  contribution  to  the  expected  cost  up  to  time  7  -F  1  is 

(4)  (1  ~  77,.)  (c;-*  -F  . . .  -F  C/_|  -F  c,  -F  1) 
whereas  if  7^  is  started  at  time  7  -F  1,  it  is 

(5)  (1  —  TtJ  (Cy_/^  -F  .  .  .  C,..]  -F  1)  -F  (1  —  77r+|)  (c,  -F  1). 

Comparing  (4)  and  (5)  we  decide  to  start  Tf  at 


(6) 


^  (1-77,+  ,) 

7  if  C,  <  — ; -  (C,  -F  1) 

I  TT^ 

7-FI  if  c,  >  (£■  -F  1) 

1  7T  - 
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1  —  ir^+i 

where,  clearly,  — ; - =  (1  -  ...  (l-p,_|). 

1  -  TTj 

For  this  situation  we  see  that  the  comparisons  can  be  made  quickly  and  sequentially  with  a 
minimum  of  data  to  be  stored. 


The  systematic  application  of  the  algorithm  when  the  4  are  different  will  now  be  described.  Sup¬ 
pose  we  are  at  stage  /  and  wish  to  determine  where  to  start  test  7}.  Suppose  that  X  denotes  the  time  of 
the  last  node  so  far  and  i>  the  time  of  the  most  recent  start  (i.e.,  the  time  at  which  7',_|  is  commenced). 
Define  the  following  quantities: 

Q  —  expected  cost  up  to  time  A  (7]  excluded); 

—  expected  cost  up  to  time  i/; 
r  —  number  of  tests  commencing  at  i/  (including  T,)  ; 

s  —  number  of  tests  commencing  or  in  progress  at  time  i/; 

ri.  i'2 . ~  indices  of  tesu  commencing  or  in  progress  at  time  i^: 

/?i  <  /?2  <  . . .  <  /?s  —  time  left  to  completion  of  tests  i^i . i',,  respectively; 

r 

^  —  fixed  costs  of  starts  at  time 

2-1 

1  -  TT^  —  probability  of  getting  through  all  tests  up  until  time  v. 


Note  that  the  index  /  is  among  the  indices  . 


(7) 


Now  if  Tj  is  started  at  time  A  then  the  new  expected  cost  is 


n  (1  - 


2-1 


(Cj  -t-  4) 


with  the  remainder  of  the  updating  as  follows: 

A'  =  A  -E  4,  *'’  =  “  ^^x>  r'  =  s'  =  2,  (i2|',  1^2)  =  1  +  1). 

R{=  min  (4,  4+i)-  ^2=  max  (4,  d,+\),  A"=  q  c,+|' 

1  -  =  n  “  4)- 

2-1 


If  T, 
(8) 


is  started  at  time  A  then  the  new  expected  cost  is 


Q'  =  C„  (1  -TT^)  (7f  -t-  -E  (l-p„|)/’.j  7?2  +  •  ■  •  +  (1  -Pr,)  •  •  •  (1 


with  the  appropriate  updating  determined  by: 


A'  =  r  -E  7?,,  v'  =  12,  Q'  =  C^,  r'  =  r  -E  1,  s'  =  s  -E  1,  K'  =  K  +  c,+i,  ttJ  = 

and  4+1  being  inserted  into  its  correct  place  among  R\,  ....  R^  to  give  the  new  indices  p\ . Vi'+i 

and  residual  times  T?/,  . . .  , 


At  each  stage  i  the  smaller  of  (7)  and  (8)  determines  the  appropriate  scheduling  of  7",  and  the 
appropriate  updated  quantities  to  proceed  to  the  next  stage. 
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C.  Multistep  Check  Algorithms 

At  the  expense  of  considerably  increased  computational  effort  the  single-step  check  algorithm  can 
be  extended  by  considering  starting  test  7]  at  several  nodes  in  the  existing  configuration  at  stage  /. 
There  are  at  most  i  possible  starting  nodes  so  that  if  each  one  of  these  was  investigated  at  each  stage  /, 
at  most  n(n  +  l)/2  different  configurations  would  be  considered. 

For  example,  one  might  start  by  considering  a  two-step  check  procedure  which  would  involve 
comparing  the  merit  of  starting,  at  stage  /,  7^  at  either  the  last  node  or  one  of  the  last  two  starting  times. 
The  algorithm  would  require  storing  considerably  more  information  at  each  stage.  However,  there  is  no 
guarantee  that  the  final  schedule  will  be  superior  to  that  produced  by  the  single-step  check  procedure. 

Indeed,  due  to  the  serious  interactive  effect  of  parallel  tests  on  the  cost  structure,  it  is  not  possible 
to  conclude  that  any  multistep  procedure  will  always  do  better  than  a  one-step  procedure.  This  is 
because  on  adding  7'„,  we  may  be  better  off  with  some  7)  starting  at  a  node  determined  by  the  one-step 
procedure  than  at  the  node  to  which  it  was  allocated  at  stage  /  by  the  multistep  procedure. 

We  therefore  do  not  look  further  at  the  multistep  check  algorithms  in  this  paper,  although  their 
accurate  description  still  poses  an  interesting  and  possibly  important  problem. 


4.  NUMERICAL  EXAMPLES 
EXAMPLE  1. 

Data:  u  =  3 


/ 

c, 

d, 

Pi 

a, 

oJPi 

1“ 

1 

1 

.25 

2 

8 

2 

8 

1 

.7 

9 

90/7 

3 

2 

1 

.2 

3 

15 

ALGORITHM:  We  note  that  4  =  1;  /  =  1,  2,  3  so  that  we  may  use  the  criterion  (6)  at  each 

stage. 

(1)  Starting  order  is  1,  2,  3. 

(2)  Start  7'2  at  0  if  C2  <  (1  -  P|)  (c2  +  D  =  (.75)  (9)  =  6.75. 

Hence,  we  start  Ti  at  time  1  and  the  configuration  stands  as 

0  I  2 

(3)  Start  T'j  at  node  1  if  C3  <  (I—  P2)  (C3-I- 1)  =  (.3)  (3)  =  .9. 

Hence,  we  start  T}  at  node  2  and  the  final  configuration  is 

0  1  2  3 


with  total  expected  cost  9.425 
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The  optimal  configuration  is  actually 


with  a  total  expected  cost  of  9.4.  We  note  that  a  two-step  check  procedure  would  have  given  us  this 
configuration  but  that  the  one-step  algorithm  gives  a  very  nearly  optimal  schedule.  Here  we  also  have 
an  example  of  a  case  for  which  the  optimal  parallel  testing  schedule  involves  inversions  of  the  testing 
order  for  a  sequential  schedule  (viz.  starting  T2  on  the  completion  of  T3). 

EXAMPLE  2. 

Data:  «  —  3 


i 

4 

Pi 

a, 

aJPi 

T 

1 

i 

.3 

2 

20/3 

2 

4 

3 

.6 

7 

35/3 

3 

1 

2 

.2 

3 

15 

ALGORITHM:  We  use  the  algorithm  outlined  by  equations  (7)  and  (8).  The  initial  order  is 
a^Tain  1,  2,  3,  so  that  stage  (1)  consists  of  starting  Tj  at  0.  We  now  tabulate  the  relevant  quantities 
defined  in  the  previous  section  for  stages  (2)  and  (3),  explaining  the  calculations  below. 

\  K  r  s  {vj)  |/?y)  K  l-TT^  C  Q 

O)  I  0  2  2  Tl  Tl  5  i  0  2~ 

(3)  4  1  2  2  3, 2  2,  3  5  .7  2  6.9 

Stage  2:  According  to  (7)  Q'  =  2  -t-  (.7)7  =  6.9 

According  to  (8)  Q'  =  0  -I-  (1)  |5  +  (.3)  1  -I-  .7(3))  =  7.4 

We  therefore  update  according  to  equation  (7). 

Stage  3:  According  to  (7)  Q  =  6.9  -1-  (.7)  (.4)  (3)  =  7.74 

According  to  (8)  Q'  =  2  -t-  (.7)  (5  -1-  (.2)  2-1-  (.8)  3)  =  7.46 

We  do  not  need  to  update  the  quantities  further.  The  tree  has  developed  as  follows: 


0  10  12 


0 


2 


(1) 


(2) 


(3) 


The  algorithm  does  produce  the  optimal  configuration  in  this  case,  with  total  expected  cost  7.46. 
CONCLUSION 

We  have  proposed  an  approach  to  personnel  test  sequencing  which  is  based  on  cost  effectiveness 
rather  than  predictive  ability.  Once  a  company  has  decided  on  which  tests  are  required  to  determine  job 
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suitability,  then  their  application  may  be  done  more  efficiently  by  considering  the  sequencing  algorithms 
discussed  above. 

It  is  clear  that  a  major  area  of  application  lies  in  recruiting  foi  specialized  jobs  in  the  government 
or  the  armed  forces.  Here,  investigating  the  various  facets  such  as  security,  health,  intelligence  and 
personality  typically  involve  high  cost  and  delays.  Moreover,  information  is  commonly  available  on  the 
a  priori  probability  that  an  applicant  will  meet  the  required  standards  in  each  area.  We  suggest  that  the 
ideas  presented  here  will  help  make  the  screening  of  applicants  more  efficient,  either  by  implementing 
the  algorithms  herein  described  or  by  encouraging  the  development  of  related  ones. 
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ABSTRACT 

We  derive  formulas  for  ihe  variance  of  lhai  proportion  of  the  value  of  a  ran¬ 
domly  locaied,  circularly  symmetric  area  target  that  is  destroyed  by  N  indepen¬ 
dently  fired  weapons  of  identical  type  whose  damage  functions  are  circularly 
symmetric  about  the  respective  impact  points.  The  probability  density  func¬ 
tions  of  the  target  center  location  and  of  the  weapon  impact  points  are  also  cir¬ 
cularly  symmetric.  The  general  results  are  specialized  to  uniform  and  Gaussian 
functions.  In  the  latter  case  a  closed-form  solution  (triple  integral)  for  Ihe  vari¬ 
ance  of  Ihe  coverage  is  derived.  Similar  to  some  well-known  results  on  expect¬ 
ed  coverage,  this  expression  for  the  variance  of  the  coverage  can  be  easily 
evaluated  by  numerical  quadrature.  Numerical  results  are  given  which  indicate 
the  target  coverage  variability  caused  by  the  combined  effects  of  random 
target-locating  errors  and  weapon  impact  point  fluctuations. 


INTRODUCTION 

Two-dimensional  coverage  problems  arise  frequently  when  assessing  the  effectiveness  of  artillery 
or  air-to-ground  weapons.  The  surveying  papers  of  Eckler  [5],  Eckler  and  Burr  [6],  Guenther  and  Ter- 
ragno  (I6l  show  that  there  are  many  papers  concerned  with  the  expected  coverage  of  an  area  target  by 
multiple  independently  fired  weapons.  This  reflects  the  importance  of  knowing  the  average 
effectiveness  of  weapon  systems.  However,  in  military  requirements  another  criterion  is  often  used: 
The  measure  of  effectiveness  for  a  salvo  of  N  rounds  is  the  proportion  e  of  the  total  value  of  an  area 
target  that  is  destroyed  with  a  probability  of  at  least  y.  Expected  coverage,  obviously,  does  not  supply 
such  information,  even  though,  of  course,  a  higher  expectation  of  destroyed  value  generally  increases 
the  confidence  level  y  for  any  given  e  (or  vice  versa).  What  would  be  required  in  the  case  of  an  e-y- 
criterion  is  not  only  the  expectation  of  the  destroyed  value  or  coverage,  but  the  distribution  of  the  cov¬ 
erage.  In  sharp  contrast  to  the  bulk  of  papers  on  expected  coverage,  only  very  little  literature  is  avail¬ 
able  on  the  distribution  problem.  Domb  (4],  Siegel  |34]  and  Votaw  [38]  have  calculated  distributions 
of  the  coverage  for  arbitrary  numbers  of  covering  objects  .V  ^  1;  they,  however,  treat  only  one¬ 
dimensional  coverage  problems  (line  segments  and  circular  arcs),  and  their  methods  do  not  seem  appli¬ 
cable  to  area  problems.  Solomon  [35]  derives  in  two  dimensions  Ihe  distribution  of  the  destroyed  value 
of  an  area  target  by  one  weapon  only  and  gives  some  upper  and  lower  bounds  for  y  (given  t)  in  the 
case  of  two  weapons.  Several  authors  have  derived  expressions  for  the  higher  moments  (especially  the 
second  moment)  of  that  fraction  of  a  fixed  geometrical  target  figure  or  body  which  is  covered  by  A'  ran¬ 
domly  thrown  figures  or  bodies  (Ailam  HI,  Bronowski  and  Neyman  |2],  Garwood  Il3l,  Greenberg 
[15],  Moran  and  Fazekas  de  St.  Groth  124),  Robbins  [25]  and  [26],  Santalo  (271).  The  principal  result 
obtained  in  this  connection  is  due  to  Robbins  (25).  Others  (Cooke  [3],  Eden.'  (7),  Flatto  (8),  Flatto 
and  Konheim  [9],  Flatto  and  Newman  llO),  Gilbert  [14],  Holst  [18]  and  [19],  Kapian  [20]  and  [21], 
Miles  [23],  Shepp  [32],  Siegel  [33],  Steutel  [36],  Stevens  [37])  estimate  the  probability  ’hat  the 
randomly  thrown  figures  cover  the  fixed  target  figure  completely,  or  estimate  the  number  of  figures 
required  to  cover  Ihe  target  figure  completely. 
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From  all  these  results,  those  on  the  second  moment  of  the  coverage  in  the  space  of  two  or  more 
dimensions  are  the  most  interesting  with  respect  to  weapon  analyses.  Although  the  second  moment  (or 
the  variance)  does  not  supply  the  precise  information  that  is  required  for  the  e-y -criterion,  it  is 
nevertheless  useful,  since  Tchebyshev's  inequality  can  in  some  cases  yield  a  lower  bound  for «  (given 
■y)  or  y  (given  e).  The  papers  cited,  however,  are  confined  to  one  situation;  this  limitation  results 
partly  from  modeling  the  coverage  problem  as  an  overlap-of-geometrical-figures  problem.  In  weapon 
analyses  terms,  it  is;  (1)  The  only  targets  considered  are  those  with  sharp  edges  and  uniform  value  den¬ 
sity.  (2)  The  only  weapons  (randomly  thrown  figures)  are  those  with  zero-one  damage  functions;  com¬ 
plete  destruction  occurs  within  the  weapon  effects  area,  and  no  destruction  without.  (3)  The  weapon 
impact  points  are  assumed  to  be  distributed  uniformly  within  a  specified  area.  For  most  applications, 
these  three  assumptions  could  be  accepted  as  more  or  less  good  approximations  of  the  real  world. 
However,  one  limitation  is  a  serious  drawback,  which  reduces  the  usefulness  of  the  former  results  on 
the  variance  of  the  coverage;  (4)  The  target  position  is  not  a  random  variable,  or  in  our  words;  The  tar¬ 
get  location  error  is  zero  with  probability  one. 

It  is  the  objective  of  this  paper  to  derive  expressions  for  the  variance  of  the  coverage  in  more  gen¬ 
eral  situations  so  that  limitations  (1)  to  (4)  are  eliminated.  First,  we  generalize  Robbins's  [25]  main 
result  on  the  moments  of  the  coverage  of  geometrical  figures  to  randomly  located  targets  with  general 
value  density  function.  Second,  we  consider  a  coverage  problem  in  which  all  underlying  functions  are 
Gaussian,  and  derive  a  closed-form  solution  (triple  integral)  for  the  variance  of  the  coverage.  In  this 
Gaussian  case  the  standard  deviation  of  the  coverage  can  be  determined  in  little  computer  time  by 
numerical  quadrature  so  that  a  parametric  analysis  can  be  made.  Numerical  results  will  be  presented 
which  clearly  indicate  the  variability  of  target  coverage  caused  by  the  combined  effects  of  random 
weapon  impact  point  and  target  location  fluctuations.  As  a  side  result,  we  generalize  a  formula  of  Gar¬ 
wood  [13]  and  Santalo  [27]  for  the  variance  of  the  coverage  of  one  circle  by  A  other  circles. 

The  concepts  of  the  Gaussian  target  and  the  Gaussian  damage  function  have  been  considered  in 
many  papers  (see,  e.g.,  Fraser  [1 1]  and  [12],  Eckler  [5],  Eckler  and  Burr  [6]),  dealing  with  problems  on 
expected  coverage;  the  main  advantage  of  this  model— beyond  its  intuitive  appeal  as  an  approximation 
of  real  world  situations— is  that  many  problem  solutions  can  be  given  in  closed  form  which  is  not  possi¬ 
ble  when  corresponding  problems  with  other  functions  are  considered.  This  same  property  will  ulti¬ 
mately  be  used  to  derive  the  triple  integral  expression  for  the  variance  of  the  coverage  of  the  normally 
located  Gaussian  target  by  warheads  with  Gaussian  damage  functions  and  normally  distributed  impact 
points. 

1.  THE  COVERAGE  PROBLEM 
Target  Value  and  Location  Distribution 

We  consider— on  an  x-y-plane,  where  x,  y  are  Cartesian  coordinates— a  circularly  symmetric  area 
target  whose  precise  position  is  not  known  because  of  a  random  locating  error  (see  Figure  1).  We 
assume  that  the  probability  density  function  (p.d.f.)  f/^  ixr.  of  the  coordinates  of  its  center  is 
known,  and  that  (xj.  yj)  is  circularly  symmetric  with  respect  to  the  origin  O  of  the  x-y-coordinale 
system,  so  that  (xf,  yj)  =  f\(xT  +  The  point  O  is  the  expected  target  center  location. 

The  area  target  value  need  not  have  a  uniform  density;  for  many  applications  it  has  proven  useful 
to  consider  the  concept  of  a  nonuniform  target  value  density  function  (u,  v)  as  done,  for  example, 
by  Eckler  [5],  Eckler  and  Burr  [6],  Helgert  [17],  McNolty  [22],  Schroeter  [28],  [29],  [30],  [31],  and 
Washburn  [39].  The  function  h'^(m.  v)  need  not  be  greater  than  or  equal  to  zero,  although  in  the 
usual  applications  it  is.  Our  following  results  will  apply  to  any  value  density  function  that  is  absolutely 
integrable  over  the  u-v-plane,  and  circularly  symmetric  with  respect  to  the  center  T  of  the  target,  so 
that  v)  =  w((w^  -I-  v^)'^^]  within  a  u-v-coordinate  system  whose  origin  is  at  T. 
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Fjgore  I  Coverage  of  circular  target  by  4  “cookie -cutter*  rounds 


P.D.F.  of  Impact  Points  of  Weapons 

In  the  following  we  consider  a  salvo  of  N  identical  rounds.  We  assume  that  the  weapons  are 
aimed  at  the  origin  O  of  the  jc-j'-plane,  the  expected  location  of  the  target  center.  (Aiming  all  weapons 
at  that  point  will  not  necessarily  maximize  the  expected  target  value  destroyed.)  Because  of  random 

delivery  errors,  however,  the  weapons  will  not  impact  at  0,  but  at  points  y^),  J  =  1 . .V, 

more  or  less  dispersed  around  O.  We  consider  the  set  of  impact  points  as  a  sample,  randomly  and 
independently  drawn  from  a  population  with  the  known  p.d.f.  k^ix,  y)  which  is  circularly  symmetric 
and  centered  at  O,  so  that  k^ix,  y)  =  /c[(x^  +  y^)'^^]. 

Damage  Function  of  Weapon  and  Coverage  of  Area  Target 

We  introduce  a  Cartesian  4^-17 -coordinate  system  whose  origin  is  at  the  impact  point  of  a  weapon. 
We  assume  that  the  effectiveness  of  the  weapon  can  be  adequately  described  by  a  circularly  symmetric 
damage  function  r))  =  -I-  with  0  <  tj)  ^  1.  This  function  has  the  following 

interpretation:  The  value  of  that  element  of  an  area  target  which  lies  at  the  point  with  the  coordinates 
i,  T)  is  reduced  by  the  factor  1  —  d^(i.  17).  This  implies  that  the  value  density  of  the  target  remaining 
after  the  bombardment  by  the  weapon  is  v)  =  v)  [1  —  d^iu  -  v  -  17^)],  wherein 

Vt  th®  target  center  coordinates  in  the  ^  —  vj-coordinate  system,  or  the  weapon  impact  point 
coordinates  in  the  u-v-coordinate  system  whose  origin  is  at  the  target  center.  (This  property  also 
applies  to  any  noncircularly  symmetric  functions  and  d^,  if  only  the  axes  of  the  tt-v-and  the 
coordinate  systems  are  respectively  parallel.  The  superscript  in  parentheses  indicates  the  number  of 
weapons  considered.)  Hence,  the  total  destroyed  value  (or  coverage)  177-)  is  the  following 

functional  of  and  d^: 

(1)  T),-)  =  J*  ^  —  ?7-,  v  -  y)j)dudy 

OD  00 

=  /_„  11  -  T)r)rf^(^,  j\)d^di). 

This  is  the  two-dimensional  convolution  of  the  two  functions  w^(u.  v)  and  -17),  or 

w^{-u.  — v)  and  d^(^.  tj). 
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If  both,  and  are  the  "cookie-cutter"  functions  for  the  two  geometric  figures  W  and 
respectively,  (i.e.,  when  they  equal  one  within  W  and  respectively,  and  zero  without),  then 
is  the  intersecting  area  of  and  3!  This  is,  of  course,  the  usual  definition  of  the  cover¬ 
age  of  one  geometric  figure  by  another.  Using  (I),  one  can  immediately  generalize  this  concept  to 
more  general  targets  and  damage  functions,  and  this  definition  has  been  applied  in  all  papers  where 
nonuniformly-valued  area  targets  have  been  considered. 

Coverage  of  Area  Target  by  Multiple  Weapons 

Remember  that  the  target  center  and  weapon  impact  points  coordinates  in  the  jf->’-coordinate  sys¬ 
tem  are  Xf.  v/,  Jf|,  ><1 . .vv,  v.v,  respectively.  The  coverage  of  the  target  by  all  the  N  weapons  then 

is  the  following  function  (cf.  Schroeter  [31],  Equation  (6)): 

(2a)  yr.  -"fi.  3'i . -v.v.  y\  )  =  A  -  J"  ^  J"  (.v  -  Xj,  - yr'l  ~  ~  ^j’  y-yj)^dxdy, 

/QO  ^  oo  /• 

J  •♦ulw.  v)duJv  ^  Define 

C  =  c*'’  and  denote  the  not-destroyed  value  K-C  by  C.  (The  complemejii  of  the  destroyed  value  or 
coverage  is  sometimes  termed  "vacancy"  (Ailam  [1],  Holst  [18])).  Define  d^ii.  r))  =  dl((~  ■+■  tj^)''^]  = 

1  “  V^-  Then 

(2b)  C(xj-,  yr,  j:|.  yi . x^^,  T,v)=J_^J_^  w^(x  —  Xf.  y  -  yj)  f[r/4(.v  — x,.  y~yi)dxdy. 

/-I 

Considering  the  targe^center  and  weapon  impact  point  coordinates  as  random  variables,  we  will  deter¬ 
mine  the  variance  of  C,  which  is  the  same  as  the  variance  of  C. 

Coverage  of  Area  Target  and  Probability  of  Killing  a  Point  Target 

In  this  paper,  we  will  not  go  deeper  into  the  analysis  of  point  targets,  It  is,  however,  interesting 
to  relate  the  concept  of  the  coverage  of  an  area  target  with  the  probability  of  killing  a  point  target,  since 
this  will  enable  us  to  generalize,  in  Section  2,  a  theorem  of  Robbins  [25]  on  the  relation  between  the 
moments  of  the  not-destroyed  value  of  the  area  target  and  the  probability  of  killing  none  of  several 
point  targets.  This  theorem  has  been  applied  by  several  authors  to  calculating  the  second  moment  of 
the  coverage  (as  done,  for  example,  by  Garwood  [13]). 

Consider  a  point  target  which  will  either  survive  an  attack  by  a  weapon  or  be  killed.  Define 
d^  (f ,  r))  to  be  the  conditional  probability  of  killing  the  point  target  provided  it  has  the  coordinates  tj 
in  the  coordinate  system  whose  origin  is  at  the  weapon  impact  point.  If  the  point  target  is  located  ran¬ 
domly  round  the  point  Ti^r'  according  to  the  probability  density  function  w^{u.  v),  then 
C*'’(^7-  ■’Jr)  as  defined  by  (I)  is  the  unconditional  probability  of  killing  the  point  target.  (This  proba¬ 
bility  is  unconditional  with  respect  to  the  random  location  of  the  point  target  round  T,  it  is  a  conditional 
probability  with  respect  to  the  location  of  T  itself.)  Evidently.  =  C  as  defined  by  (2a)  equally 
yields  the  probability  of  the  randomly  located  point  target  being  killed  by  at  least  one  round  of  the 
salvo.  Hence,  the  concepts  of  coverage  of  area  target  and  probability  of  kill  of  point  target  are 
equivalent,  when  d^  is  interpreted  in  the  dual  form,  as  done  above,  against  point  and  area  targets. 

Also,  when  xt.  yr-  X],  yi . Xj^,  y^  are  considered  as  random  variables,  then  the  expectation  of 

the  destroyed  value  of  the  area  target  equals  the  totally  unconditional  probability  of  killing  the  point  tar¬ 
get.  This  equivalence  is  well-known  (see,  e.g.,  McNoliy  [22],  Schroeter  [28]  and  [31]). 

I  am  indebted  to  the  referee  for  having  pointed  out  the  following  interpretation:  Let  X  denote  an 
indicator  variable  for  the  kill  of  the  point  target,  i.e.,  T  =  1,  whenever  the  point  target  is  killed,  and 
X  =  0  otherwise.  Again,  let  v)  be  a  probability  density  function.  Then  C*'”’  =  C  from  (1)  and 

(2a)  equals  the  conditional  expectation  of  X.  Similarly,  equals  the  unconditional  expectation  of  X^ 
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Ex-  Since  X  is  either  0  or  1,  its  distribution  is  completely  known  whenever  Ex  is.  Especially,  the  vari¬ 
ance  of  X,  o-J,  equals  £^(1  -  Ex).  Thus,  nothing  would  remain  to  say  on  the  variance  of  the  coverage 
if  such  were  the  problem  to  be  analyzed  in  the  present  paper.  Instead,  what  we  are  addressing  could  be 
described,  somewhat  loosely,  as  a  target  consisting  of  an  infinite  number  of  point  targets,  their  fre¬ 
quency  round  T  being  proportionate  to  the  density  function  v),  and  from  (I)  and  (2a) 

denoting  the  fraction  killed.  The  variance  of  that  fraction,  <rc,  generally  is  less  than  Ef-d  -  Ec). 

2.  MOMENTS  OF  NOT-DESTROYED  VALUE 


The  mth  Moment  of  the  Not-Destroyed  Value 


The  mth  moment  of  the  not-destroyed  value  C  is 
(3a) 


XOO  /*  ^  ^00^00 

...  \  \  J  yvxix  - 

-oo  ipr  _oo  •/ —  oo  •/— oe> 


Xr.  il  ~  y  ~ 


/a  (xt.  yr)  n  yj)dxrdyrdXidyi  . . .  dx/^dy,^. 


We  write  the  mth  power  of  the  inner  double  integral  with  respect  to  x,  y  as  a  repeated  2m-fold  integral 
and  interchange  the  order  of  the  integrations.  This  is  allowed  since  the  integrand  is  an  absolutely 
integrable  function  of  each  of  its  arguments.  Thus,  we  obtain 


^  oo  /*  /W  N  fft 

“  J-oo  ■  •  •  J-oc  y?)  n  V,  -  yr)  n  ^'1  3'P  n  ~  ~  yj'> 


I- 1 


J-1 


(-1 


dxxdy\  ...  dxNdysdxrdyrd^xdrn  ...  d^„diq„. 


The  inner  repeated  integral  with  respect  to  xi,  yj,  ...  ,  x\,  ys  may  now  be  written  as  the  Mh  power 
of  a  double  integral,  so  that 


(3b) 


yr)  n  ^  -  XT,  Vi  -  yr) 
»—  1 


v)  T}j-T))</^rfT7 


i-i 


N 

dxrdyrd^xdTXi  ...  d^„dVm- 


m 

The  integral  of  ~  Xj,  Vi  ~  yr^  with  respect  to  Xj,  y-f  is  completely  analogous  to 

/-I 
m  _ 

the  integral  of  v)  V,  ~  v)  with  respect  to  (,  r).  This  suggests  the  following  func- 

/-I 

tional  be  defined: 


<4)  171 . im,  Vm'^A-  ’  v)  d^  ,  T),  -  T))  df  ^7}, 

1-1 

which  implies 

/oo  ^  oo  rn  ^ 

.o  J-co  yr)  n  '*’A((i-xr,  Vi  -  yr)dxrdyr  =  Q„(^i,  Vi . V^^/a,  w^)- 

1-1 

Using  (4)  and  (5)  we  obtain  for  the  mth  moment  of  the  not-destroyed  value  as  an  alternative  form  of 
(3b): 

(3c)  . (m,Vm-,fA, 

V) . fm.  Vm’^  k^.  dx)d^xd7)x  ...  di„dr)„. 
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This  shows  that  is  an  integral  of  a  product,  whose  second  factor  is  independent  of  the  target  value 
density  function  and  the  target  location  error  p.d.f.;  this  factor  contains  all  information  on  the  weapons 
and  on  the  p.d.f.  of  their  impact  points.  The  first  factor  contains  all  information  on  the  target  and  its 
location  error,  and  is  independent  of  damage  functions  and  impact  point  p.d.f.’s. 

The  usefulness  of  (3b)  in  contradistinction  to  (3a)  will  become  apparent  when  we  consider  the 
second  moment,  at  the  same  time  taking  into  account  that,  in  real  situations,  the  number  of  weapons  N 
fired  in  a  salvo  generally  is  much  greater  than  2.  Both,  in  (3a)  and  in  (3b),  the  inner  integrals  are  dou¬ 
ble  integrals;  the  outer  integrals  are  (2N  +  2)-fold  in  (3a)  and  (2m  -1-  2)-fold  in  (3b).  For  N  >  m, 
therefore,  a  reduction  of  the  number  of  integration  processes  is  possible  by  using  (3b)  instead  of  (3a). 

Interpretation  of  the  Functional  Q„ 

For  m  >  1.  Q„  does  not  satisfy  the  commutative  and  associative  laws  for  its  argument  functions, 
such  as  the  ordinary  convolution  does  It  has,  however,  a  statistjeal  interpretation  similar  to  that  of  the 
convolution:  Let  w^(f.  tj)  and  f^ix,  y)  be  two  p.d.f.’s.  Then  0m(f|,  ni-  •  ••  -  'hml  /a^  **'^)  is  the 
joint  p.d.f  of  the  coordinates  t\,  of  m  points  fi,.  >=1,  ...  ,  m,  which  are  located  randomly  and 
independently  according  to  the  p.d.f  tv^  around  a  common  center  T  which,  for  its  part,  is  drawn  ran¬ 
domly  from  the  x-y-plane  according  to  the  p.d.f  /^(x,  y).  Another  interpretation  results  from  (2b); 
evidently 

C(xt.  yr.  X|,  . . .  y^)  =  Qs^xr  -  x^,  yr-  Ti-  •••  .  Xt  -  x,^,  yr  -  y^, 


Robbins's  Theorem 

An  examination  of  (4)  will  show  that  0„(f|,  m.  •  •  •  •  d^)  is  the  probability  of  killing 

none  of  m  different  point  targets,  located  at  the  points  S/(^,.  tj,),  by  a  single  round  whose  datmge  func¬ 
tion  is  and  whose  impact  point  p.d.f  is  k^(^,  if).  Hence,  t)1'  •  ••  •  fm<  Vm-  ^a<  d^)  is  the 

probability  of  killing  none  of  these  point  targets  by  a  salvo  of  N  identical  and  independent  rounds. 

Interpreting,  as  in  the  section  before,  Vi . Vm'>  as  a  joint  p.d.f,  we  see  from 

(3c)  that  the  mth  moment  of  the  not-destroyed  value  of  the  area  target,  equals  the  unconditional 
probability  of  killing  none  of  m  randomly  located  point  targets— which  are  distributed  according  to  the 
p.d.f  round  the  common  randomly  located  center  T— with  the  salvo  of  N  rounds.  This  is  the  gen¬ 
eralization  of  Robbins's  (25)  result  on  the  moments  of  the  coverage  of  a  fixed  geometrical  figure  by 
randomly  thrown  figures  to  the  coverage  of  a  randomly  located  target  with  arbitrary  value  density  func¬ 
tion  by  weapons  with  general  damage  functions.  Evidently,  these  results  equally  apply  to  coverage 
problems  where  the  underlying  functions  are  not  circularly  symmetric. 

Expressions  for  the  Second  Moment 

Because  of  the  assumed  circular  symmetry  of  d^,  and  the  integrals 

i.  V\.  fj-  172;  /a-  /a^^-  T7)'*’^(^I  -  T)1  -  TI2  -  T|)rffrfT), 

Q2^^\-  V\.  ^2.  V2\  kA‘  dA)  -  ^  ^  ^  k^(f.  V^d^^^x  -  V\~  V^d^((2  “  V2  -  v'^d^dy} 

depend  only  on  the  relative  positions  of  the  three  points  0(0,  0),  fii(f|,  tji),  and  ^2(^2  172)-  There¬ 
fore  we  may,  without  loss  of  gen'''ality,  expre.ss  them  in  terms  of  the  three  parameters  t\,  ti,  0  whose 
geometrical  interpretation  is  indicated  in  Figure  2.  A  routine  manipulation  then  yields  for  the  second 
moment  of  the  not-destroyed  value: 

(6)  E^-:  -  27r  ij.  0;  /.  w)g5'(f,.  tj.  «■  k.  d)  t.i^dedt^dt-,. 
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wherein 

“  Xo  Jo  l<{T)d\(T^  +  t}  —  Irti  cos0)'''^] 

+  /2  —  2t/2  cos(</>  —  9))'^^)  Td<t>dT, 

and  the  analogous  relation  holds  for  Q^Ux,  0,  f  w).  If  we  change  from  /],  ti,  0  to  x,  y.  p  by 
means  of  fj  =  [(x  +  p)^  +  I2  =  ((x  -  p)^  +  y^V^^,  0  =  arctan  [(x  +  p)/y]  -  arctan  ((x  -  pVyJ, 

the  Jacobian  is  Apl(t\t2)-  (The  geometric  interpretation  of  x,  y,  p  also  is  indicated  in  Figure  2.) 
Hence, 

(7)  E^2  =  Stt  /  ^  X  ^  Xo  (?2(->f.  y.  p;  /a-  ^aWi  (-t.  y.  p;  ^^)p  dpdxdy, 
wherein 

(8)  Q2(x,  y,  p,  k^,  d^)  =  \  I  ky,(u  -  X,  V  ~  y)d^(u  -  p,  v)d^{u  +  p,  \)dud'v, 

and  the  analogous  relation  holds  for  Q2(-t.  y-  P'^/a-  Considering  that  d^  =  \  -  d^,  (8)  may  be 
transformed  into 

(9)  Q2(.x.  y.  p;  k^,  d^)  =  I  -  Qx(i\\  k,  d)  -  QxUi,  k,  d)  +  Q2(x,  y,  p;  k^,  d^), 
wherein 

Q\(t\  k,  </)  =  Xg  Jq  k{r)dl(t^  +  r^  -  2lr  cos0)''^^ir  d(t>dr. 


o 


and  X.  y.  p 

3.  SECOND  MOMENT  IN  SPECIAL  CASES:  GAUSSIAN  AND  UNIFORM  FUNCTIONS 
Gaussian  Damage  and  Impact  Point  Probability  Density  Functions 

We  consider  the  circular-Gaussian  damage  function  v)  «  exp[—  (w^  +  v^)/(2<7-d)1,  with  the 

shape  parameter  a-g.  Furthermore,  we  assume  that  the  impact  point  p.d.f.  k^(-  ,  ■)  is  circular-normal, 
with  the  standard  deviation  <t.  It  follows  immediately  from  the  convolution  property  of  Gaussian  func¬ 
tions  that  Q\(t ,  k,  d)  —  a-pia-^  -F  o-p)”'  exp(—  -F  2crp)l.  The  function  Q2(x,  y,  p;  d^)  can 

also  be  written  integral -free.  A  routine  manipulation  results  in 

Qlix,  y,  p\ k^.  djx)  =  (27r<r^)“’exp(-pVa’p) 

f  f  exp [—  -F  v^)/(rplexp{-  [(u  -  x)^ -F  (v  -  y)^]/(2(T^)l  rfurfv. 

*/  — oe  •/  —00 
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We  recognize  the  double  integral  as  the  two-dimensional  convolution  of  two  Gaussian  functions,  so 
that  this  integral  can  be  expressed  as  a  Gaussian  function  of  x  and  y.  Consequently: 

(10)  Q2^x,  y,  f),  k4,  d^)  =  (T^da-^  +  +  y^)/(2(T^  +  cTp)]. 


Gaussian  Target  Value  and  Location  Probability  Density  Functions 


For  a  Gaussian  target  with  the  total  value  of  one  and  the  shape  parameter  <tz,  we  have 
w^iu,  v)  =  (2770-2)  'exp(-  (u^  +  v^)/(2o-2)1.  Let  the  p.d.f.  of  the  coordinates  of  its  center  be 
circular-normal,  with  the  standard  deviation  o-/-.  Then,  quite  similar  to  the  above  case, 
01  (t;  /  w)  =  [277(0-1  -f  o-^)]“'exp[- 77(20-2  +  2o-f)]  and 

(11)  y,  W4)  =  [477^1(20-2 -1- o-2)l~'exp[-pVo-2  -  (x^  r  y^)/(2(Tz  + 


Combining  these  value  density  and  location  p.d.f.’s  with  the  Gaussian  damage  functions  and  the  normal 
impact  point  p.d.f.  will,  for  the  expectation  of  the  not-destroyed  value,  yield 


(12) 


+  o-z)  '  X 


exp 


2(o-f  +  o-l) 


1 

'  ~ 


2(0-^  +  O-fl) 


f  dt. 


Observing  (9)  and  inserting  (10)  and  (11)  into  (7)  will,  for  the  second  moment  of  the  not-destroyed 
value,  yield  a  triple  integral  with  respect  to  x,  y,  p.  Then,  a  change  to  polar  coordinates  by  x  =  r  cosd>, 
y  =  r  sin<^  will  finally  result  in 


(13) 


£.1  = 


8 


Jk  00  ^ 


exp 


_  -£1  _ 


o-|  2o-f -4-0-1 


rp 


^  irl2 

pr  cos(t> 

Jo 

1  2  2 

4-  a-f) 

2(o-^  +  (Tp) 

cos  h 

cr^  4-  a-p 

<xd 


2(t^  4-  a-p 


exp 


-ei 

O-D 


2o-^  4-  O-n 


d<i>drdp. 


The  integrals  (12)  and  (13)  can  be  solved  in  closed  form,  by  developing  the  Mh  powers  and  integrating 
the  sums  term  by  term.  Practically,  this  is  of  little  interest,  since  the  complexity  of  the  integrands  is 
prohibitive  for  large  N.  (12)  and  (13),  however,  can  be  computed  easily  by  numerical  quadrature  for 
arbitrary  values  of  N. 


Coverage  of  Fixed  Circle  by  Randomly  Thrown  Circles 

In  this  section  we  apply  our  main  result  of  Section  2  to  generalizing  results  on  a  circularly  sym¬ 
metric  coverage  problem  considered  by  Garwood  [13]  and  Santalo  [27] .  These  authors  calculated  the 
second  moment  of  the  coverage  of  a  fixed  circle  (radius  Rt)  by  N  identical,  randomly  thrown  circles 
(radius  a)  when  the  centers  of  the  latter  are  distributed  uniformly  over  a  circle  with  radius  Rp.  The 
centers  of  the  two  circles  with  the  radii  R^  and  Rp  coincide. 


The  damage  function  dir),  in  this  case,  is  1  if  t  <  a,  and  0  else.  We  denote  by  k^iu,  v)  the 
uniform  p.d.f.  over  the  circle  with  the  radius  Rp.  Then  irRlQ^it,  k,  d)  is  the  intersecting  area 
S|(7;  Rp,  a)  of  the  two  circles  with  the  radii  a  and  Rp  whose  centers  are  the  distance  /  apart.  Further¬ 
more,  TTRpQiii],  t2,  k,  d)  is  the  intersecting  area  of  the  three  circles  with  the  radii  Rp,  a,  a,  and 
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the  centers  at  the  corners  O,  fli,  B2,  respectively,  of  the  triangle  indicated  in  Fig.  2.  We  denote  this 
intersecting  area  of  the  three  circles  mentioned  by  Sjf/i,  !2-  0)  and,  using  (9),  obtain 

(14)  02(^1.  h-  ^)  =  1  “  (7r/?o)''lS|(fi;  Rj),  a)  +  S^(t2\  Rd<  0)  ~  S2U\,  12,  fi'.  Rp- 

Next  we  consider  the  target  whose  value  density  is  one  within  a  circle  of  radius  Rr^  and  zero  without. 
We  assume  that  this  target  is  centered,  with  the  probability  one,  at  the  point  0(0,  0).  In  this  case 
S;  /,  w)  =  1  if  f,  <  Rr  and  I2  <  and  0  otherwise.  Hence  (6)  yields 

%  =  J/”  (?5'(r,,  t2.  fl;  k,  d)t,t2dndt,dt2. 

Combining  this  with  (14)  results  in  the  second  moment  for  that  proportion  of  the  fixed  circle  which  is 
not  covered  by  the  N  randomly  thrown  circles  of  radius  o: 

(15)  £^2  =  27r  (1  —  (ir/?o)“'[5'|(ri; /?o,  fl)  +  S|(/2; 

-  S2('i'  h-  a)Vi^ t\t2d(idi\dt2. 

This  is  the  generalization  to  arbitrary  relations  of  Rf,  Rp^  and  22,  of  results  of  Garwood  and  Santalo, 

who  considered  only  the  case  Rp  -  a  ^  Rf-  In  the  latter  case,  (15)  may  be  reduced  to  a  one-fold 

integral;  a  short  manipulation  results  in 

2R 

£,2=27r  f  ^  SAz-.Rt.  Rt)^\  -  (-irR^rHlna^  -  a.  aWzdz, 
which  is  an  alternative  form  of  Garwood’s  Equation  (43)  and  Santalo  s  Equation  (5.3), 

4.  NUMERICAL  RESULTS  FOR  GAUSSIAN  FUNCTIONS 

Equations  (12)  and  (13)  were  evaluated  by  numerical  integration,  and  the  standard  deviation  a-c 
of  the  destroyed  value  was  determined  by  means  of  the  relation  o-^  =  =  £^2  -  (£f  )^-  In  all  cases 

we  assumed  ctz  =  1.  Figure  3-6  show  in  the  Ff—o- (-plane  the  two  families  of  parameter  curves 
(Tr=  const,  and  N  =  const,  for  o-p  =  0.1  and  0.2,  cr  =  1  and  3.  Many  practically  important  cases  fall 
within  the  range  of  these  parameters. 

Discussion  of  Results 

The  figures  indicate  that,  for  (T;-  ^  K  the  standard  deviation  of  the  coverage  is  so  small  that  the 
one-sided  Tchebyshev  inequality  £(£(-  —  acre  ^  C)  ^  aVd  +  may  be  well  applied  to  deriving  a 
confidence  level  for  the  minimum  destroyed  value.  Take,  e.g.,  o-p  =  0.1,  o-  =  1,  o-r  =  0.5,  A  =  250 
rounds.  It  can  be  seen  in  Figure  3  that  £(-  s  57%,  cr(  =  5%,  so  that  the  coverage  is  with  a  probability 
of  at  least  90%  greater  than  42%.  For  o-r  »  1.  however,  Tchebyshev ’s  inequality  will  be  of  little 
value  for  our  problem. 

We  see  that,  for  increasing  a-j  and  A,  the  standard  deviation  of  the  coverage  o-(  approaches  its 
theoretical  upper  bound  on  the  circle  with  the  equation  cr^  =  £(  (1  —  £( ).  This  circle  is  drawn  as  a 
dashed  line.  This  property  implies  that  the  probability  mass  concentrates  more  and  more  near  C  =  0 
and  C  =  1. 

Numerical  Method 

The  integrals  (12)  and  (13)  were  evaluated  by  a  Gauss-Legendre  quadrature.  For  (13),  generally 
24^  supporting  points  were  used.  For  special  values  of  the  parameters,  32'  supporting  points  were 
chosen.  With  a  Fortran  program,  on  a  CDC  6500  computer  approximately  three  seconds  of  central  pro¬ 
cessor  time  was  required  to  determine  one  value  of  with  24'  supporting  points. 
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ABSTRACT 

This  paper  investigales  the  two-person  zero-sum  multiple  payoff  game  in 
which  the  objective  is  to  minimize  a  player’s  total  underachievement  from  a 
fixed  set  of  goals.  It  is  demonstrated  that  a  previous  formulation  of  this  prob¬ 
lem  can  be  substantially  simplified. 


I.  INTRODUCTION 

In  a  previous  paper  in  NRLQ,  Cook  [1]  discussed  the  formulation  and  solution  of  a  two-person, 
zero-sum  game  with  multiple  payoffs  in  which  the  objective  is  to  minimize  the  total  underachievement 
from  a  fixed  set  of  goals  for  each  of  the  objectives. 

The  resulting  formulation  is 
(1)  min  /3 

subject  to 

^  x,Aij(r)  -  /3  <  0  Vj,r;x  Ex 

i-i 

where  player  1  selects  strategy  ('with  probability  Aj, 

-f  =  . x„) 

^  =  <-^1 . x„) 

A(r)=  ^ 

k>=\ 

A^=  (tf*)  =  (g^-a!‘) 

K  is  the  number  of  objectives 
,/  is  the  index  for  player  2’s  strategies 

X*  is  the  goal  for  the  kth  objective 
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a*  is  the  payoff  for  the  /tth  objective  corresponding  to  player  1  choosing  strategy  /  and  player 
2  choosing  strategy  j 

A,j{r)  is  the  i/th  component  of  A  ir) 

a*  is  the  set  of  extreme  points  of  th 


<l>  = 


=  (a’ 


'')  0  <  a*  <  H-' 


r  =  1,  . . .  ,  is  the  index  on  the  extreme  points  of  i/(. 
w*'  =  the  relative  weight  placed  on  the  icth  goal. 

This  is  a  linear  programming  problem.  After  calculating  the  A(r)  matrices,  the  problem  involves 
the  minimization  of  a  linear  function  subject  to  —  1)  +  1  constraints  where  m  is  the  number  of 
strategies  available  to  player  1  and  K  is  the  number  of  multiple  payoffs.  Also,  the  calculation  of  the  2'^ 
A  (/•)  matrices  prior  to  solving  the  problem  can  be  quite  time-consuming. 

The  intent  of  this  paper  is  to  demonstrate  that  the  same  problem  can  be  solved  by  minimizing  a 
linear  function  subject  to  nfA  1)  +  1  constraints,  where  n  is  the  number  of  strategies  available  to 
player  2.  Also,  only  K  matrices  must  be  calculated  prior  to  the  solution  of  the  problem  rather  than  2^ 
matrices. 


2.  A  NEW  FORMULATION 

Letting  aj)  represent  the  ilrth  payoff  to  player  1  if  player  1  chooses  strategy  /  with  probability  x,, 

tn 

/  =  1,  ...  m,  and  player  2  chooses  strategy  y,  y  =  1 . //,  it  follows  that  ^  is  the  Mh  payoff  to 

/-I 

player  1  for  this  combination  of  strategies. 


Thus, 


dj  =  max 


0.,?* 


is  the  underachievement  of  goal  kig'^)  corresponding  to  this  strategy  combination. 


But  dj  —  max 


0,  £  (g*  -  o,5)x, 


since  is  constant  and  ^  x,  =  1. 


Since  each  of  these  </*  values  are  weighted  by  corresponding  w*  values,  player  1  selects  x, 


0,  £  X,  =  1 


which  will  yield 


K 

K 

max  y  . . 

i^\ 

where  K  is  the  number  of  multiple  payoffs. 
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Hence,  the  problem  can  be  expressed  as 

(2)  min  v 
subject  to 

A 

V  >  ^  w'^dj  Vj 

k~[ 

dj  =  max 

m 

I^;=  1 

/-I 

x/  ^  0  Vi 

where  v  is  the  value  of  the  game. 

However,  this  problem  may  be  expressed  as  a  linear  programming  problem. 

THEOREM  1:  Formulation  (1)  is  equivalent  to  the  linear  programming  problem 

(3)  min  v 
subject  to 

V  >  £  wV*  Vj 

fj  >  a,?)  Xi  Vj,  Vk 

/-I 

m 

1^,  =  1 
/-I 

T,  >  0  Vj,  f/  >  OVj,  Vk. 


£  {g'‘  -  a’-)x;.  0 


Vj,  Vk 


PROOF.  Suppose  v,  is  the  optimal  solution  to  (1)  and  V2  is  the  optimal  solution  to  (2).  We  want 
to  show  that  V)  =  V2.  Clearly,  V2  ^  V|  since  the  w*  are  positive  and/y'  ^  d*  for  all  j  and  k. 

K 

Thus,  it  must  be  shown  that  V2  <  vj.  Suppose  V2  >  Vj.  Then  V|  =  £  for  some  j,  say 

n-i 

j  =  y I,  and  some  set  of  x,'  values,  say  x/  =  x/^  V  i.  Letting  =  x/^,  for  all  ;  values  and  =  d,*  for  all 
k  values,  we  obtain  a  feasible  solution  for  problem  (2)  which  is  equal  to  V)  and  less  than  V2.  This  is  a 
contradiction  to  the  original  assumption. 

The  following  theorem  demonstrates  that  formulation  (3)  is  equivalent  to  Cook’s  formulation  (1). 


THEOREM  2:  Formulations  (1)  and  (3)  are  equivalent. 

PROOF:  Cook's  formulation  is 

min  max  £  w*d*(x,  y) 
y  k 


where  x  and  y  range  over  m  and  n  dimensional  unit  simplices,  and 
d*(x,  y)  =  max  |o,  ££  x*  (a*  -  atj^yA 
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Formulation  (3)  is 

min  max  V  w'‘dHx) 

X  j-1.2 . n  f-  ^ 

where 

d^ix)  =  max  0,  x,  (g*  -  a,}) 

I 

Clearly,  y)  <  J  djix)yj  for  all  k  for  any  x,  y. 


Also,  since  w*  >  0  for  all  k, 

^  w'^d'^  (x.y)  <  ^  X  **’'‘dj‘(x)  yj 

k  J  * 

and 

max  y  w'^d'^ix,  y)  <  max  V  V  w'^dfix)  y, 

y  k  y  j  k 


But,  if  is  the  extreme  point  of  they-simplex  whose  y-th  coordinate  is  1, 
r/*(jr,  ?■')  =  dj(x). 


Thus, 

max  y  w'^dHx)  =  max  y  (x,  e^). 

j  r  '  2-1.2 . xf 


Also, 

max  y  )v*(y*(x.  e2)  <  max  y  y) 

2  t*  2'  * 

since  the  sums  considered  on  the  left  are  a  subset  of  those  considered  on  the  right. 


Consequently, 

max  y  w*d''(x,  y)  =  max  T  H’*d*(x) 
y  k  J  k 

and  the  two  formulations  are  equivalent. 


3.  EXAMPLES 


EXAMPLE  1;  Consider  the  multigoal  game  solved  by  Cook  [Ij  with  three  different  payoffs  where 
gi  =  1,  g2  _  5  ^  4  corresponding  payoff  matrices  given  by 


-10  7 
3-4 


2  4 
-6-1 


and  weights  w'  -  2,  h'^  =  .5  and  =  1.  The  problem  may  be  formulated  as 
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min  V 


subject  to 


V  ^  2/1  +  .5/1  +  /f 

v  ^  2/2'  -F  .5/i  +  /I 

-Xj 

^  fl 

15x, 

+ 

2x2 

2x, 

-t- 

10X2 

<  y? 

7x, 

-F 

8x2 

-2x, 

-F 

9X2 

5X2 

<  fi 

i  =  1 

;-l 


X,  >  0  Vi,  >  0  Vj,  Vk. 

The  solution  to  this  problem  is  xj  =  1,  X2  =  0,  v  =  14,  ti  =  0  T2  “  '  a*  given  by  Cook.  Note 
that  in  this  formulation  only  three  matrices  (the  ((^*-a,*)),  k=  1,2,3)  need  be  calculated  in 

advance.  The  eight  extreme  points  a,,  /•  =  1 . 8  and  the  eight  matrices  A  (r),  r  —  1 . 8  are 

not  required.  Also,  there  are  9  constraints  instead  of  the  15  required  by  Cook. 

EXAMPLE  2:  In  this  example  provided  also  by  Cook,  the  three  goals  are  g'  =  4,  =  1,  g^  =  2 

with  corresponding  weights  w'  =  1,  =  2,  =  2.5.  The  payoff  matrices  are 


2  5  1 

-1  -2  6 

II 

-3  7  2 

0-2  0 

.  = 

8  -2  3 

-5  6  0 

0  3-1 

3-16 

-3  1  6 

Using  the  formulation  given  in  (2),  the  solution  to  this  problem  is  v  =  6.324S6,  X]  =  .49123. 
X2  =  .20175,  X3  =  .30702.  This  solution  differs  from  that  given  by  Cook  (v=  5.8148,  X|  =  .636024, 
X2  =  .157764,  X3  =  .20621 1),  but  this  is  the  result  of  some  errors  in  the  A  (r)  matrices  calculated  by 
Cook.  The  corrections  in  these  matrices  which  need  to  be  made  are  /l3|(2)  =  4,  /1 3,(4)  -10. 
/43i(5)  =  0,  A}f{6)  =  16.5,  Aj,ib)  =  -5,  />33(7)  =  4,  /l3j(8)  =  12.5,  and  /1 33(8)  =  9.  The  errors  were 
apparently  the  results  of  calculating  Un  (the  1-1  element  of  /4|)  to  be  -4  instead  of  4  and  calculating 
aji  to  be  4  instead  of  -4.  When  these  corrections  are  made.  Cook's  formulation  also  produces  the 
solution  given  above. 

The  formulation  in  (2)  has  13  constraints  and  the  previous  formulation  has  22  constraints. 


3.  SUMMARY 

The  formulation  of  the  two-person  zero-sum  multiple  payoff  game  given  in  (2)  is  an  improvement 
over  the  previous  formulation  in  that  the  number  of  constraints  increases  proiportionately  with  the 
number  of  strategies  available  to  player  2  and  proportionately  with  the  number  of  payoffs  as  opposed  to 
increasing  proportionately  with  the  number  of  strategies  available  to  player  1  and  exponentially  with  the 
number  of  payoffs.  In  general,  the  new  formulation  has  n(Af  -F  1)  -t-  1  constraints  instead  of 
-  I)  +  1  constraints. 
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For  example,  with  5  payoffs  present,  the  new  formulation  has  31,  25,  and  19  constiaints  when 
there  are  5,  4,  and  3  strategies,  respectively,  available  to  each  player.  In  the  previous  formulation  there 
are  156,  125,  and  ‘*4  constraints,  respectively. 

Finally,  it  should  be  noted  that  there  is  no  appropriate  formulation  corresponding  to  (2)  for  the 
payoff  to  player  2  since  the  formulation  leads  to  unbounded  values  for  the  /f  and  the  objective  function 
is  consequently  unbounded. 

ACKNOWLEDGMENT 

1  would  like  to  thank  an  anonymous  referee  for  demonstrating  the  equivalence  provided  in 
Theorem  2. 


REFERENCE 

[1]  Cook,  W.D.,  "Zero-Sum  Games  with  Multiple  Goals,"  Naval  Research  Logistics  Quarterly.  23. 
615-622  (1976). 


naval  research  Loots  l  ies  QUARTERLY 


VOL.  29,  NO.  1,  MARCH  1982 


ON  A  GENERALIZED  BIN-PACKING  PROBLEM 

R.  T.  Lewis 

R.C.A.  Corporation 
Princeton,  New  Jersey 

R.  G.  Parker 

School  of  Industrial  and  Systems  Engineering 
Georgia  Institute  of  Technology 
Atanta,  Georgia 

ABSTRACT 

In  this  paper  we  address  a  bin-packing  problem  which  possesses  a  variety  of 
modifications  of  the  classic  theme  Among  these  are  bin-dependent  chip 
weights,  bin  costs,  and  bin-dependent  penalties  for  unused  capacity.  Lagrangi- 
an  relaxations  are  employed  in  the  context  of  a  branch-and-bound  framework 
in  order  to  solve  the  problem  after  which  substantial  computational  experience 
is  provided. 


1.  INTRODUCTION 
1.1  Problem  Statement 

The  classic,  one-dimensional  bin  packing  problem  seeks  the  minimum  number  of  bins  (of  finite 
capacity)  required  to  pack  a  set  of  chips  possessing  given,  bin-independent  displacements  (weight, 
length,  volume,  etc  ).  In  this  paper,  we  consider  a  generalization  of  this  theme  which  can  be  made  pre¬ 
cise  by  model  (GBPPl)  below: 

m 

(1)  (GBPPl);  Minimize;  ^  (ciyj  +  d,u,) 

n 

(2)  Subject  to;  +  «,  =  Wjy,.  1  <  y  ^  m 

(-1 

(3)  £  x,,  =  1.  1  <  (  <  n 

(-1 

(4)  Xj),  y,  binary 

where;  is  a  strictly  positive,  integer  capacity  of  bin  j, 
til  is  the  deplacement  of  chip  i  if  assigned  to  bin  j, 

Uj  is  the  unused  capacity  of  bin  y, 

Cy  is  a  positive,  integer  cost  of  bin  y,  and 

di  is  a  positive,  integer  cost  of  unused  capacity  of  bin 
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The  binary  variables  x,,  and  y,  relate  to  the  assignment  of  chip  i  to  bin  J  and  the  use  of  bin  j,  respec¬ 
tively.  We  assume  the  problem  dimension  to  be  /i(chips)  by  m(bins)  where  n  and  m  are  such  that  at 
least  one  feasible  pack  is  possible.  By  "feasible  pack"  we  mean  any  solution  satisfying  (2)-(4).  Addi¬ 
tionally,  we  assume  the  unused  capacity  cost  (and  the  bin  cost)  to  be  realized  only  if  a  bin  is  used. 

While  not  considered  in  the  research  presented  here,  it  is  worth  remarking  that  it  may  be  of 
interest  to  constrain  (GBPPl)  further  by  assuming  the  existence  of  various  packing  restrictions.  To 
illustrate,  suppose  chips  represent  drums  of  chemicals  where  two  chemical  types  which  otherwise  could 
be  stored  together,  may  possess  properties  making  such  storage  undesirable.  Letting  R,  be  the  set  of 
(possibly  empty)  pairwise  packing  restrictions  for  bin  j,  we  could  append  (GBPPl)  by  constraints  of  the 
form 

(5)  <  1.  (k,  h)  e  Rj,  1  <  7  ^  m. 

Indeed,  constraints  of  this  nature  have  been  considered  in  other  work  by  the  first  author  (22);  however, 
in  what  follows,  we  have  assumed  R,  -  0  for  all  j. 

It  is  easy  to  see  that  the  classic  bin-packing  problem  is  but  a  special  case  of  (GBPPl).  In  particu¬ 
lar,  we  can  set  c,  =  1 ,  dj  =  0,  w,  =  h>  for  1  <  7  <  m  and  let  /„  =  f,  for  1  <  /  <  n,  1  <  7  ^  m. 
Equivalently,  letting  c,  =  0  and  «//  =  1  for  1  <  7  <  «  with  /,y,  and  w,  as  just  specified  achieves  the 
same  result. 


While  bin-dependent  chip  displacement  and  differing  bin  capacities  provide  rather  natural  generali¬ 
zations  of  the  classic  problem,  the  cost  penalty  for  unused  capacity  is  less  so.  In  addition,  this  cost 
penalty  for  unused  capacity  can  clearly  create  problems  where  a  solution  using  more  costly  bins  will 
have  a  smaller  total  cost  than  a  solution  where  less  costly  bins  are  used.  To  see  this  consider  the  fol¬ 
lowing  small  illustration: 


\ 

i 

5 

J 

1 

2 

3 

4 

5 

j- 

1  2 

3 

4 

, 

C| 

1/3c2 

00 

00 

00 
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C|  4/5c, 

3/5C| 

2/5c, 

l/5c, 

2 

l/3c, 

1  C2 

00 

00 

00 

d, 

d,  5/4d, 

5/3d, 

5/2d, 

5d, 

3 

1 

00 

1/3c2 

C3 

00 

00 

W,-. 

Cl  C2 

C3 

C4 

C5 

4 

00 

l/3r2 

00 

C4 

00 

5 

l/3r, 

1 

00 

00 

C5 

a  pack 

requiring  the 

fewest  bins  is  achieved  with  X21 

=  X5,  ^ 

“  X,2  =  X32 

X42-  i 

1  and 

elsewhere.  The  cost  or  value  of  this  solution  is  C|  -I-  C2  -f-  c^d^/3  =  — ^  +  c^d^/3.  Consider  now  the 

5  ]  g 

solution  x„  =  1  for  ail  i  =  7  and  =  0  elsewhere,  having  value  ^  Ci=  3ci.  If  0  ^  </i  <  —  the  first 


18 


/-I 


solution  costs  less  than  the  second.  Alternately,  if  dj  >  — ,  the  latter  solution  is  belter. 


A  central  ingredient  (although  not  the  only  one)  in  differentiating  between  least  cost  and  classic, 
least  cardinality  packs  is  the  assumed  penalty  or  charge  for  unused  bin  capacity  relative  to  bins  used. 
As  one  might  expect,  this  implicit  constraint  on  free  resource  disposal  creates  a  degree  of  difficulty 
which  empirically  speaking,  can  make  for  exceedingly  difficult  problems.  Note  that  we  use  the  "empiri¬ 
cal"  qualifier  here  to  distinguish  the  difficulty  of  (GBPPl)  from  that  of  the  classic  bin-packing  problem 
since  otherwise  the  notion  of  difficulty  or  complexity  is  indistinguishable  due  to  the  NP-completeness 
of  the  classic  problem. 
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It  is  interesting  to  imagine  physical  systems  where  (GBPPl)  applies.  One  example  arises  in  the 
case  of  so-called  parallel-processor  scheduling  ll,  31.  Here,  we  seek  a  schedule-creating  assignment  of 
single  operation  jobs  to  a  set  of  processors  which  minimizes  the  completion  time  (called  makespan)  of 
all  jobs.  With  a  modest  degree  of  effort  it  is  easy  to  see  that  this  is  but  a  dual-like  version  of  a  bin¬ 
packing  problem.  Specifying  some  limit  (e.g.,  lower  bound)  on  the  overall  completion  time,  we  then 
seek  an  m-bin  pack  of  chips  (jobs)  to  the  bins  (processors)  where  each  bin  has  capacity  no  greater  than 
this  specified  limit  on  processing  time  for  the  analogous  processor.  When  processors  are  nonidentical, 
the  bin-packing  interpretation  reflects  bin-dependent  chip  weights.  The  cost  of  bins  corresponds  to, 
say,  a  fixed,  processor  utilization  charge  (e.g.,  setup  cost)  and  assuming  that  once  a  processor  completes 
its  processing  assignment,  any  remaining  idle  time  is  costly  (e.g.,  opportunity  cost)  we  have  the  unused 
capacity  penalty  in  (GBPPl). 

Following,  we  present  a  brief  review  of  bin  packing  and  related  literature  after  which,  a  rather 
extensive  treatment  of  a  series  of  models  pertaining  to  (GBPPl)  is  included.  Here,  a  variety  of  formu¬ 
lations  are  examined  from  the  perspective  of  their  potential  usefulness.  The  most  promising  of  these  is 
explicitly  demonstrated  and  then  evaluated  by  a  set  of  computational  experiments. 

1.2  Review  of  Relevant  Literature 

Presently,  much  of  the  literature  deals  only  with  the  traditional  bin-packing  problem.  Further, 
there  appears  to  be  less  than  complete  standardization  in  the  open  literature  regarding  the  terms  bin¬ 
packing  problem,  cutting-stock  problem,  loading  problem,  and  multiple-knapsack  problem.  Occasion¬ 
ally,  the  terms  are  used  interchangeably,  and  often  they  are  not.  In  order  to  avoid  any  confusion,  the 
terms  bin-packing  or  cutting-stock  will  be  taken  by  convention  here  to  mean  the  problem  where  all 
objects  must  be  packed  or  cut  and  the  objective  is  to  minimize  the  number  of  bins  or  pieces  of  stock 
used.  The  terms  loadtng  or  multiple  knapsack  will  be  taken  to  refer  to  the  related  problem  where  not 
all  the  objects  must  be  loaded  or  assigned  and  the  objective  is  to  maximize  the  value  of  the  loaded 
items. 

One  of  the  earliest  appearances  of  the  classic  bin-packing  problem  in  the  literature  is  in  the  paper 
of  Eilon  and  Christofides  [6].  While  several  formulations  of  the  bin-packing  problem  as  well  as  related 
problems  are  discussed  in  l6l ,  the  one  actually  solved  is  as  follows: 


m  m 


(6) 

Minimize: 

f-|  >-l 

n 

(7) 

Subject  to: 

^  r,x,,  <  w.l  <  y  <  m 
1 

m 

(8) 

Xii  -  I.I  <  /  ^  n 

/“  1 

(9) 

Xij  binary 

where  v,,.,  =  pv,,  v,  =  1,  and  p  is  the  largest  number  of  objects  that  can  be  packed  into  a  bin.  This 
formulation  is  interesting  because  it  eliminates  the  need  for  y,  variables  to  indicate  the  use  of  a  bin 
(each  with  capacity  w)  due  to  the  structuring  of  the  v,  coefficients  which  ensures  that  the  smallest 
number  of  bins  is  used.  Devised  in  [61  is  an  exact  and  a  heuristic  algorithm  for  solving  the  problem. 
The  exact  procedure  is  a  variation  of  the  tree-search  algorithm  of  Balas  [2]  while  the  heuristic  algorithm 
is  a  variation  of  the  best-fit  decreasing  algorithm  which  will  be  discussed  shortly. 
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Performance  guarantees  on  the  quality  of  the  solution  for  bin-packing  heuristics  have  received 
substantial  attention.  In  1972  and  1973  Garey,  Graham  and  Ullman  [101,  presented  worst-case  results 
of  various  heuristic  algorithms.  However,  it  was  Johnson's  doctoral  thesis  in  1973  (19)  which  laid  the 
theoretical  groundwork  for  the  worst-case  analysis  of  the  class  of  heuristic  algorithms  known  as  any-fit. 
A  summary  of  this  work  appears  in  [201.  Also,  Johnson,  Demers,  Ullman,  Garey  and  Graham,  have 
presented  a  comprehensive  survey  of  both  old  and  new  results  in  |21) 

The  four  any-fit  algorithms  of  primary  interest  are  as  follows; 

Let  0  =  (1,  2 . nl  be  the  set  of  objects  or  chips,  and  let 

y  =  (l.  2 . /n)  be  the  set  of  bins. 

1.  First-Fit  (FF):  Assign  each  object  from  set  0  in  the  order  in  which  it  appears  in  the  set  to 
the  bin  in  set  7  with  the  lowest  index  into  which  it  will  feasibly  fit. 

2.  Best-Fit  (BF):  Assign  each  object  from  set  0  in  the  order  in  which  it  appears  in  the  set  to 
the  bin  in  set  /  such  that  the  resulting  unused  capacity  is  minimal. 

3.  First-Fit  Decreasing  (FFD);  Order  the  set  0  in  nonincreasing  order  on  the  basis  of  object 
displacement.  Apply  the  FF  algorithm. 

4.  Best-Fit  Decreasing  (BFD):  Order  the  set  0  in  nonincreasing  order  on  the  basis  of  object 
displacement.  Apply  the  BF  algorithm. 

The  worst-case  results  are  expressed  as  a  ratio  of  the  solution^  value  of  the  heuristic  to  the  optimal 
solution  for  each  algorithm.  The  ratios  for  these  algorithms,  say  /?,  are  given  as  asymptotic  results  and 
appear  as  follows  (21 1; 

1.  lim  Rff  (k)  “  17/10 

k—oo 

2.  lim  =  17/10 

k  — oo 

3.  lim  Rf/rpik)  =  1 1/9 

k  — <» 

4.  \im  Rgfoik)  =  \\/9 

k~^oa 

Here,  k  is  the  optimum  number  of  bins  required  for  a  given  list  of  chip  displacements. 

More  recently  Hung  and  Brown  in  1978  (16]  considered  a  form  of  the  bin-packing  problem  in 
which  the  bins  were  permitted  to  have  differing  capacities.  Their  formulation  follows: 


m 


(10) 

Minimize: 

Lyr 

f-l 

(11) 

Subject  to: 

n 

Z  i,x„  <  w,y^ 

l•\ 

1  <  y  <  m 

(12) 

m 

L  ^9  =  1 

/-I 

1  ^  n 

(13) 

X,,,  yi  binary. 
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The  algorithm  developed  is  basically  one  of  implicit  enumeration.  However,  a  characterization  of 
assignments  was  given  which  capitalizes  on  any  equivalence  in  object  displacements  or  bin  capacities. 
This  permitted  a  development  of  a  set  of  rules  which  excludes  redundant  assignments. 

In  1975  Ingargiola  and  Korsh  [181  presented  an  enumerative  algorithm  for  solving  the  related 
loading  or  multiple-knapsack  problem.  The  formulation  of  the  problem  considered  appears  as: 


n  m 


(14) 

Minimize: 

Z  Z 

1=1  /-I 

(15) 

Subject  to: 

n 

Z  ^  *^/ 

1 

1  <  7  ^  m 

(16) 

m 

1  X„  ^  1 
/-I 

1  <  /  <  n 

(17) 

Xif,  binary. 

The  algorithm  in  [181  is  based  on  the  notion  that  before  a  search  is  attempted,  as  many  decisions  as 
possible  should  be  made  about  the  inclusion  and  exclusion  of  objects  from  the  knapsacks.  This  is 
accomplished  by  the  introduction  of  an  ordering  relation  among  objects  which  generates  those  objects 
that  will  be  excluded  in  an  optimal  solution  if  a  given  object  is  included. 

In  1978,  Hung  and  Fisk  [17]  presented  a  branch-and-bound  algorithm  for  solving  the  same  prob¬ 
lem  as  that  of  Ingargiola  and  Korsh.  Here,  it  was  shown  that  it  is  possible  to  compute  the  optimal  LP 
dual  multipliers  in  closed  form.  Furthermore,  these  multipliers  for  the  first  set  of  constraints  were 
shown  to  equal  a  constant.  Two  relaxations  of  the  problem  were  developed;  lagrangian  and  surrogate. 
In  the  lagrangian  relaxation  the  second  set  of  constraints  are  relaxed  by  use  of  the  optimal  LP  dual  mul¬ 
tipliers,  and  the  problem  decomposes  into  separate  knapsack  problems  for  each  bin.  In  the  surrogate 
relaxation  the  first  set  of  constraints  is  combined  into  a  single  constraint,  and  the  second  set  of  con¬ 
straints  is  substituted  into  this  constraint;  the  problem  then  reduces  to  a  single-knapsack  problem. 

In  a  follow-up  article  Fisk  and  Hung  [91  presented  a  heuristic  algorithm  for  solving  the  same 
problem.  In  fact,  they  used  the  same  surrogate  relaxation  as  discussed  above.  Their  procedure  is 
founded  upon  the  observation  that  if  a  feasible  solution  can  be  found  among  the  objects  and  bins 
identified  in  the  optimal  solution  to  the  relaxed  problem,  then  an  optimal  solution  to  the  original  prob¬ 
lem  has  been  found.  The  search  for  a  feasible  solution  employs  a  set  of  exchange  rules  which  considers 
all  possible  one-for-one,  one-for-two  and  two-for-one  exchanges  of  objects  between  bins. 

Of  additional  interest  is  work  relating  to  what  has  been  popularized  as  "generalized  assignment 
models."  Included  are  a  myriad  of  formulations  which  deviate  in  various  ways  from  the  traditional 
assignment  theme.  Subsumed  by  these  generalizations,  of  course,  is  the  classic  bin-packing  problem. 
A  lucid  exposition  of  an  algorithmic  developments  within  this  area  can  be  found  in  [24]  and  [25]. 

Other  references  pertaining  to  bin-packing  and  related  problems  can  also  be  cited.  A  partial  list 
would  include  works  in  [4,  5,  7,  11,  27). 

2.  METHODOLOGICAL  DEVELOPMENTS 

There  are  a  variety  of  ways  to  approach  (GBPPl),  classic  among  which  is  a  branch-and-bound 
scheme  where  bounds  are  derived  from  an  appropriately  constructed  linear  programming  relaxation.  In 
fact,  such  an  approach  was  pursued  by  the  authors  where  (GBPPl)  is  augmented  with  clique  constraints 
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which  tighien  the  relaxation.  These  clique  constraints,  derived  from  implicit  as  well  as  explicit  packing 
restrictions,  can  be  quite  powerful  in  characterizing  the  integer  hull  (e  g.,  123])  of  (GBPPI)  but  can 
themselves  be  relatively  difficult  to  construct.  This  should  be  obvious  since  determining  cliques  in 
graphs  is  NP-complele  [12].  To  this  extent,  the  linear  programming  relaxations  which  were  developed 
did  not  result  in  especially  encouraging  computational  performance  (especially  when  /f,  ^  As  a 
consequence,  we  turn  to  an  alternative  attack,  also  rather  classic,  where  we  create  lagrangian  relaxations 
of  the  problem  [26].  In  what  follows,  we  consider  various  of  these  relaxations  presenting 
computational  results  for  the  more  promising  ones. 

Suppose  we  recast  (GBPPI)  in  the  form  given  by  (GBPP2)  below  where  we  let  u,  =  H,y,  - 

/I 

^  2,  =  I  —  y,  and  where,  in  addition,  we  replace  it,  in  the  first  set  of  constraints  by  a  slack  vari- 

I- 1 

able.  Restating  the  problem  as  a  maximization  problem,  we  have: 


m  m  m  n 


(18) 

(GBPP2):  ^  (c,  +  (J,  w,,  —  Maximize: 

£(c,  +  +  Y.  ^ 

,~i  i-\  -1 

(19) 

Subject  to: 

L  './•’f./  +  ^  w,,  1  $  /  ^  m 

/-I 

(20) 

m 

X  A,,  =  1 ,  \  ^  i  ^  n 

i-\ 

(21) 

x,i.  binary. 

In  the  ensuing  developments,  we  will  adapt  the  notational  convention  whereby  q,  denotes  the  lagrange 
multiplier  associated  with  the  packing  constraints  and  r,,  the  multiplier  associated  with  the  assignment 
restrictions. 

2.1  l.agrangian  Relaxation  of  Packing  Constraints 

If  the  packing  restrictions  of  (GBPP2)  are  relaxed,  the  lagrangian  formulation  for  a  given  nonne¬ 
gative  vector  <7  can  be  stated  by  (LRU  below: 


mm  ri 


(22)  (LRl): 

^  (c,  -E  d,  w,)  —  ^  —  Maximum: 

/-I  i-i 

/“  I 

ni  »i 

+  Z  Z  ~  \ 

i-l  —1 

(23) 

Subject  to: 

^  .v„  =  1 ,  1  <  /  ^ 

t-  1 

(24) 

X,,.  binary. 

Now,  relaxing  the  packing  constraints  of  (GBPP2)  leads  to  a  relaxation  for  which  the  optimal 
solution  yields  a  bound  value  identical  to  that  obtained  from  the  solution  of  the  LP  relaxation  of 
(GBPP2).  This  result  follows  directly  from  the  integrality  property  given  in  Geoffrion  [13]  which  states 
that  an  optimal  value  of  a  lagrangian  relaxation  is  not  altered  by  dropping  the  integrality  conditions  on 
its  variables.  The  implication  of  this  property  is  fairly  immediate  and  can  be  given  by  the  following: 
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THEOREM  1.  Let  the  LP  relaxation  be  feasible  and  let  the  lagrangian  relaxation  have  the 
integrality  property.  Then  the  maximum  value  of  the  lagrangian  relaxation  is  equal  to  the  value  of  the 
LP  relaxation. 

PROOF;  The  theorem  follows  trivially  from  the  integrality  property  specified  in  [13]. 

Clearly,  (LRl)  possesses  the  integrality  property  and  by  Theorem  1  its  maximum  value  equals  the 
value  of  the  LP  relaxation  of  (GBPP2).  As  a  consequence,  little  may  be  gained  in  formulating  and 
solving  (LRl)  that  is  not  otherwise  achievable  by  simply  solving  the  LP  relaxation  of  (GBPP2).  For 
this  reason,  the  model  relaxing  the  packing  constraints  is  not  pursued  further. 

2.2  Lagrangian  Relaxation  of  the  Assignment  Constraints 

Suppose  we  pursue  an  alternative  lagrangian  formulation  based  upon  a  relaxation  of  the  con¬ 
straints  pertaining  to  assignment.  As  it  turns  out,  this  relaxation  will  prove  to  be  the  most  fruitful  in 
this  work  and,  in  fact,  provides  the  basis  of  the  principle  algorithm  developed  and  subsequently  tested. 

If  the  assignment  constraints  are  relaxed,  the  lagrangian  relaxation  of  (GBPP2)  for  a  given  vector 
rcan  be  given  by  (LR2)  below: 

(25)  (LR2):  (c,  +  d,  w,)  -  X  ^  ~  Maximum:  [c,  +  d,Wj)zi  + 

/- 1  /  /-I 

1-1  -I 
n 

(26)  Subject  to:  ^  i„x,;  +  w^z,  <  1  ^  y  ^  m 

.-1 

(27)  x,i,  Zj  binary. 

To  maximize  the  value  of  this  relaxation,  we  want  to  find  those  multipliers  r,  which  solve  the  following 
lagrangian  dual  problem: 

n  m  m  n  I 

(28)  (LDl):  Minimize:  X  maximum:  ^  (c,  +  djWj)Zj  +  ^  ^  (d/t,y  -  | 

1-1  y-l  j-\  1-1 

n 

>-l 

x„,  z,  binary 

(29)  Subject  to:  r,  unrestricted,  I  ^  ^ 

Consider  the  maximization  part  of  (LDl).  The  following  result  defines  an  optimality  condition 
that  must  hold  for  this  maximization  problem: 

LEMMA  I;  Let  z*,  x*  be  the  optimal  solution  to  (LDl)  for  a  given  r  Then 

(30)  (c,  -f-  d,Wi)z*  -E  5^  (d,t„  -  r,)x*  ^  (c,  +  1  ^  7  <  m. 
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PROOF:  By  contradiction  suppose  that  at  optimality  (30)  is  violated  for  some  bin  j.  This  implies 
that  -*=  0  and  that  x*  =  1  for  one  or  more  objects.  However,  the  value  of  this  solution  can  always  be 
increased  by  letting  2*=  1  x*  -  0  for  all  /  which  contradicts  the  assumption  of  optimality  and  the  proof 
is  complete. 

By  use  of  Lemma  1,  (LDl)  can  be  transformed  into  the  following  problem: 

(31)  (LD2):  Minimize:  X  X 

I- 1  i-i  /-I 

(32)  Subject  to:  <7,  ^  £  («f,/„  —  r,,  -  (c,  +  d,Wi) .  1  ^  /c  <  p, 

1  ^  j  ^  m 

(33)  Qi  Q,  1  <  y  ^  m;  r,  unrestricted,  1  ^  <  n. 

where  /,*  is  the  set  of  objects  in  the  Aah  feasible  pack  for  bin  j  and  p,  is  the  number  of  feasible  packs 
for  bin  j. 

To  see  the  relationship  creating  (LD2)  note  that  the  objective  function  of  the  maximization  part 
of  (LDl)  must  contain  a  constant  term,  Cf  +  for  each  bin  j.  The  value  of  the  objective  function  in 
excess  of  this  constant  is  ^  (rf,/„  —  r,)  -  (c,  +  t/,H'^)  since  the  excess  can  only  occur  when  objects  are 

packed  into  bin  j  and  not  when  bin  j  is  closed.  Furthermore,  the  objective  function  of  (LDl)  requires 
that  the  value  in  excess  of  this  constant  be  maximized.  If  we  let  q,  represent  the  maximum  value  then 
Qj  ^  X  ~  ~  ^  0  fo*'  3*1  ^  which  is  the  constraint  set  and  the  sign  restriction  for 

m 

q,  in  (LD2).  Finally,  since  the  objective  function  of  (LDl)  requires  that  q,  be  minimized,  we  have 

/-I 

the  objective  function  of  (LD2). 


Note  that  the  objective  function  of  (LD2)  contains  a  constant  term  which  cancels  exactly  the  con¬ 
stant  term  in  the  objective  function  of  (LR2).  Since  (LD2)  is  a  transformation  of  (LDl),  the  lagran- 
gian  dual  problem  for  (LR2)  can  be  written  as  follows: 


n  m 


(34)  (LD3): 

Minimize: 

£  ^  +  £  Q,- 

.-1  /-I 

(35) 

Subject  to: 

£  r,  +  q,^ 

1  ^ 

3^ 

/A 

'ii,k 

1  ^ 

./'  ^  m 

(36) 

-  <7,  ^  0.  1  ^  ./ 

<  m,  r,  unrestricted. 

1  ^ 

i  ^  n 

Writing  the  LP  dual  of  this  problem  we  have  (LD4): 


(37)  (LD4): 

Minimize: 

£ 

+  d,W, 

-  £ 

d,ii,  ' 

/k 

'‘V 

(38) 

Subject  to: 

£  V 

,k  -  1. 

1  ^  / 

<  n 

/*€/, 

(39) 

£  ^,k 

^  1  , 

1  < ./ 

^  m 

k 
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(40)  V,*  >  0  .  \  <  k  p,,  1  ^  y  <  m  . 

where  is  the  dual  multiplier  associated  with  each  constraint  in  (LD3),  the  objective  function  has 
been  changed  from  a  maximization  to  a  minimization,  and  /,  =  10,  k)\i  €  /y*l. 

Observe  that  due  to  the  constraints,  each  in  (LD4)  is  further  restricted  to  be  upper-bounded 
by  1.  If  each  v,*  is  thought  of  as  a  variable  which  selects  a  pack  /,4  or  some  portion  of  it,  then  (LD4) 
is  the  problem  of  selecting  a  set  of  packs  (possibly  fractional)  such  that  the  cost  of  the  packs  is  minim¬ 
ized.  each  object  is  packed  once,  and  at  most  one  pack  is  selected  from  each  bin.  The  cost  of  a  pack  is 
the  cost  of  a  bin,  c,  plus  the  cost  of  any  unused  capacity  in  bin  J,  djw,  — 


Problem  (LD4)  can  be  further  constrained  by  the  addition  of  a  lower  bound  on  the  number  of 
packs  that  must  be  selected.  With  the  addition  of  this  constraint,  (LD4)  can  be  rewritten  as  follows; 


(41)  (LDS): 

Minimize: 

I 

C, 

+ 

-  L  ‘^I'ii 

'^ik 

ik 

iil,k 

(42) 

Subject  to: 

I 

V, 

,*=  1. 

\  4  i  4  n 

(43) 

I' 

<  1. 

1  4  j  4  m 

k 


(44)  ^  v,n  ^  /  and  v,*  ^  0, 

Ik 

Here,  /  is  a  lower  bound  on  the  number  of  bins  required 


1  ^ 

I  ^  J  4  m. 


Relaxing  the  assignment  constraints  leads  to  a  lagrangian  relaxation  for  which  determining  the 
optimal  value  of  the  relaxation  is  equivalent  to  solving  a  large  LP  problem.  This  notion,  with  the 
exception  of  the  constraint  on  the  minimum  number  of  bins,  follows  from  the  results  of  Fisher, 
Northup,  and  Shapiro  181.  The  discrete  optimization  problem  which  is  considered  takes  the  following 
form: 


(45)  Minimize:  /'(^) 

Subject  to:  g(x)  ^  b 

X  ^  X 

where  /(x)  is  a  scalar  valued  function,  g(x)  is  a  function  from  R"  to  R",  and  A"  is  a  set  of  discrete  ele¬ 
ments.  Fisher  showed  that  the  lagrangian  dual  of  this  problem,  where  the  constraints  are  relaxed,  can 
be  transformed  into  the  following  problem: 

7 

(46)  Minimize: 

1-1 

1 

Subject  to:  ^ 

/-I 

k ,  ^  0  for  all  I 

where  x'  is  an  element  in  the  set  X and  T  —  iJfl. 
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The  equivalence  between  (LD4)  and  the  above  LP  is  easy  to  verify: 


I.  The  index  k  is  equivalent  to  i.  When  the  assignment  constraints  are  relaxed,  (GBPP2) 
decomposes  into  separate  problems  for  each  bin.  Thus,  there  is  a  separate  index  k  for 
each  bin  j. 


2. 

3. 

4. 


Xj  —  {/,i.  Iji.  ....  Ifk 

Iji,  is  equivalent  to  x'. 

\cj  +  djWj  -  ^  djti\ 


1,  the  set  of  feasible  packs  for  bin  j,  is  equivalent  to  X. 
That  is,  /,*  is  an  element  of  X,. 
is  equivalent  to  f(x'). 


5.  X  ''jk  equivalent  to  g(x')  and  is  equivalent  to  b. 

6.  The  convexification  constraint  in  (LD4)  can  also  be  stated  as  an  equality  if  one  includes 
the  empty  pack  where  y,  =  0  as  a  feasible  pack.  The  variable  representing  the  empty  pack 
is  the  slack  variable. 


7.  v,^  is  equivalent  to  X 


Let  us  return  to  (LD5).  If  one  were  to  enumerate  all  feasible  packs  for  every  bin  and  solve  the 
resulting  LP  problem,  the  amount  of  computational  effort  would  be  large  indeed.  However,  it  may  not 
be  necessary  to  consider  all  possible  packs  or  for  that  matter  even  a  large  number  of  them.  What  are  of 
interest  are  the  "goods"  packs;  that  is,  those  which  will  be  in  an  optimal  solution  to  (LD5).  Required 
then  is  a  procedure  for  generating  these  "goods"  packs.  Such  a  procedure  can  be  stated  as  follows: 


A  Column  Generation  Procedure 


Step  (0):  Generate  a  slack  variable  for  each  bin  and  add  it  to  the  problem.  Determine  a  value 
for  /,  the  minimum  number  of  bins  required.  If  a  feasible  solution  to  (GBPP2)  is 
known,  generate  the  packs  which  correspond  to  this  solution  and  add  these  columns 
to  the  problem. 

Step  (1):  Solve  (LD5)  yielding  a  set  of  optimal  dual  multipliers.  Let  r*  correspond  to  the  first 
set  of  constraints  in  (LD5),  q\  the  second  set,  and  s\  the  third  set. 


(47) 

(48) 


Step  (2);  For  each  bin  j,  solve  the  following  knapsack  problem: 

m 

Maximize:  ^  (</,/„  +  /••)jf„ 

/-I 

m 

Subject  to:  ^  r„x„  <  w,. 

/-I 


Let  jf*  be  the  optimal  solution  for  bin  /. 

Step  (3):  For  each  bin  j,  check  if  the  column  generated  is  an  improving  column.  That  is,  if 
(49)  X  rpc*  +  <?;  +  s*  >  c,  +  d,w,  -  d,t„x' 


then  add  the  column  corresponding  to  x*  to  the  problem. 
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Step  (4)'.  If  at  least  one  of  the  generated  columns  is  an  improving  column,  go  to  Step  1.  Otherwise, 
the  current  solution  to  (LD5)  is  optimal.  If  the  current  solution  to  (LD5)  contains  artificial 
variables,  then  there  exists  no  feasible  solution  to  (GBPP2). 

The  framework  of  the  procedure  above  is,  of  course,  standard  and  has  enjoyed  frequent  use  else¬ 
where.  In  particular,  the  use  of  a  column  generation  scheme  was  employed  in  [14,  15l  for  the  solution 
of  the  classic  cutting-stock  problem. 

2.3  The  Oeveloiiinent  of  the  Branch -and-Bound  Procedure 

In  this  section,  we  suggest  a  branch-and-bound  procedure  which  is  used  in  resolving  any 
differences  between  the  solution  to  (LD5)  and  the  solution  to  (GBPP2).  Since  the  LP  problem  in 
(LDS)  results  from  a  langrangian  relaxation  of  the  assignment  constraints,  it  is  possible  that  the  value 
of  the  optimal  solution  to  (LDS)  may  be  less  than  the  value  of  the  optimal  solution  to  (GBPP2). 
Furthermore,  even  if  the  value  of  the  optimal  solution  to  (LDS)  equals  that  of  the  optimal  solution  to 
(GBPP2),  the  solution  to  (LDS)  may  be  fractional. 

Bounds  in  the  branch-and-bound  procedure  are  determined  by  solving  a  relaxed  problem  with  the 
optimal  dual  multipliers  from  the  solution  to  (LDS).  The  problem  which  is  relaxed  is  as  follows: 


m  m  n 


(50) 

(GBPP3):  Minimize: 

L  (c,  +  d/*V;)yy  - 
/-I 

/-I  i-I 

(51) 

Subject  to: 

n 

L  ^  %3',  - 

1  <  y  <  m 

(52) 

m 

L  Xr,  =  1 

/-I 

1  <  /  <  rt 

(53) 

Xij.  y,  binary. 

u 

Problem  (GBPP3)  follows  from  (GBPPl)  by  letting  u,  ■=  in  the  objective  function  and 

I- 1 

by  replacing  u,  with  a  slack  variable  in  the  first  set  of  constraints.  The  lagrangian  relaxation  of  the 
assignment  constraints  of  (GBPP3)  for  a  given  r  result  in  (LR3)  below: 


(S4)  (LR3):  ''i  Minimize: 

i-i 


£  (C;  +  d^w,)y^ 


/-! 


I 


I 


/-I 


(djt,,  +  r,)x,^ 


(SS) 


n 

Subject  to:  Y.  ^  1  ^  ./  m 

(-1 


(S6) 


Xii,  yj  binary. 


In  this  form,  (LR3)  contains  no  constraint  on  the  number  of  bins  that  are  required  to  solve  the 

m 

problem.  The  relaxation  could  be  augmented,  for  example,  with  a  constraint  of  the  form  X  3*/  ^  ^ 

y-i 

where  A:  is  a  lower  bound  on  the  number  of  bins  required.  However,  this  constraint  makes  no  distinc- 
tion  between  the  capacity  of  the  bins.  A  constraint  of  the  following  form  Y,  ^  more 

/-I 

discriminating.  Here  t  is  a  lower  bound  on  the  bin  capacity  needed  to  solve  the  problem.  If  the  dis¬ 
placement  of  objects  were  independent  of  the  bins  in  which  the  objects  are  packed,  then  r  would  simply 
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n 

be  ^  t,.  However,  the  displacement  of  objects  can  vary  from  bin  to  bin;  thus,  t  must  be  determined  as 

/-I 

n 

y  min  (/,,),  since  there  may  exist  a  solution  where  each  object  attains  its  minimum  displacement. 

However,  depending  on  the  range  of  object  displacements,  this  capacity  constraint  may  be  rather  weak. 
A  capacity  constraint  which  seems  to  more  accurately  reflect  the  capacities  of  the  bins  can  be  given  as 

m 

Oiyi  >  n  where  a,  is  the  maximum  number  of  objects  that  can  be  packed  into  bin  j.  Clearly  this 

/-I 

constraint  is  valid  since  any  feasible  solution  must  allow  for  n  objects  to  be  packed. 

The  value  of  5^  for  each  bin  j  can  be  determined  by  solving  the  following  knapsack  problem 


(KPl): 

(57) 

(KPl):  Maximize: 

n 

/-I 

(58) 

Subject  to: 

n 

X,  <  ‘‘'z 
1-1 

(59) 

x,i  binary. 

While  this  problem  can  be  treated  with  any  standard  knapsack  algorithm,  it  can  also  be  solved  by  the 
following  procedure: 


KPl  Solution 


Step  (1):  For  bin  j  sort  the  objects  in  nondecreasing  order  on  the  basis  of  f„.  Let  (i)  denote 
the  i  th  element  in  the  sort. 


p 

Step  (2):  Let  p  be  the  greates*  integer  (0  ^  p  ^  n)  such  that  ^  f(,),  < 

/-I 


Step  (3):  The  optimal  solution  is  X(,),  =  1  for  »  =  1,  2 . p  and  Xf,)^  =  0  for  /  —  p  +  1, 

p  +  2 . n  if  p  >  0  or  x„  =  0  for  ail  /  if  p  =  0. 


LEMMA  2:  The  KPl  solution  procedure  solves  problem  (KPl)  optimally. 


PROOF:  Let  p*  be  the  value  of  p  determined  in  Step  2  of  the  procedure.  If  p*  =  0,  then  the 
solution  is  trivially  optimal.  If  p*  >  0  then  it  is  possible  to  pack  at  least  p*  chips  into  bin  ,/ as  shown  in 

Step  3.  Let  the  chips  packed  be  denoted  by  (1).(2) . (p*).  The  chips  not  packed  are  then 

(p*  +  1),  (p*  +  2),  ....  (n)  such  that  i(,,,  >  u,  for  /  =  p*  +  1,  p*  +  2 . n  where 

~  ^  'ill/  In  order  for  a  solution  to  (KPl)  to  exist  in  which  it  is  possible  to  pack  more  than  p* 

I- 1 

chips  into  bin  /,  it  must  be  possible  to  exchange  at  least  q  +  1  chips  not  in  the  current  pack  for  q  chips 
which  are  in  the  pack.  However,  the  total  displacement  of  any  q  chips  not  packed  must  be  at  least 
equal  to  the  displacement  of  any  q  which  are  packed.  Further,  the  displacement  of  any  size  chip  not 
packed  must  exceed  Thus,  it  is  not  possible  to  create  an  exchange  of  q  +  1  unpacked  chips  for  q 
which  are  packed  into  bin  j.  Hence,  p*  is  the  maximum  number  of  chips  which  can  be  packed  and  the 
proof  is  complete. 


When  augmented  with  the  capacity  constraint,  (LR3)  can  be  rewritten  in  the  form  given  by 
(LR4): 
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n  m  m  n 


(60)  (LR4): 

5^  n  +  Minimize:  ^  ”  L  L 

/-I  i-\  j-\  1-1 

(61) 

n 

Subject  to:  ^  1  <  7  <  w 

/-I 

(62) 

m 

Z  «/>'/  ^ " 

(-1 

(63) 

x„.  y,  binary. 

The  solution  to 

(LR4)  can  be  obtained  as  follows: 

LR4  Solution  Procedure 

Step  (1) 

For  a  given  r,  solve  the  following  knapsack  problem  for  each 

(64) 

m 

Maximize:  ^  Ojtj,  +  r,)x,i 
/-I 

(65) 

n 

Subject  to:^  ri,x„  ^  w, 

im  1 

(66) 

Xj,  binary. 

Let  x*  be  the  optimal  solution  to  this  problem  for  bin  j  and  let  v,  ^  c,  +  diWi  - 

I- 1 

(d,r„  +  Ox’. 


Step  (2): 


For  each  bin  J  solve  KPl.  Let  xj  be  the  optimal  solution  to  (KPl)  and  let 

5,  =  Z  </• 

I 


(67) 

(68) 

(69) 


Step  (3);  Solve  the  following  knapsack  problem: 

m 

Minimize:  v,>'/ 

;-l 

m 

Subject  to:  ^  a,>'/  >  n 
1-1 

y,  €  (0.  1). 


Let  y’  be  the  optimal  solution  to  this  problem  and  let  the  value  of  the  lagrangean  relaxation 

L-ir,  +  t  vX- 

,-i  /-I 

The  correctness  of  this  procedure  is  given  by  the  following  result: 

THEOREM  3:  The  LR4  solution  procedure  solves  problem  (LR4). 
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PROOF;  In  an  optimal  solution  to  (LR4)  either  y,  =  0  or  y,  =  1  for  each  ,/.  If  y,  =  0  for  some  7, 
then  X,,  =  0  for  all  i  and  the  value  of  the  lagrangian  relaxation  associated  with  bin  7  is  zero  (i.e., 
V,  =  0).  However,  if  y,  =  1,  then  the  optimal  value  of  the  lagrangian  relaxation  associated  with  bin  7  is 
the  optimal  value  of  the  following  problem: 

(70) 

Minimize: 

n 

U'i  +  d,Wn  -  Y,  +  r,)x,i 

(71) 

Subject  to: 

n 

/-I 

(72) 

X„  €  {0,  ll. 

This,  however,  is  the  same  problem  as  that  which  is  solved  in  Step  (1)  of  the  (LR4)  solution  pro¬ 
cedure.  Thus,  V,  is  the  optimal  value  of  the  lagrangian  relaxation  asociated  with  bin  j  and  the  solution 
of  (LR4)  reduces  to  the  solution  of  the  same  problem  which  is  solved  in  Step  (3)  of  the  procedure. 

The  branch-and-bound  procedure  to  be  used  in  resolving  differences  between  the  optimal  solution 
to  (LD5)  and  the  optimal  solution  to  (GBPP2)  can  now  be  stated  formally. 

Solution  Procedure  for  (GBPP2) 

Step  (0):  Solve  (LD5):  if  the  optimal  solution  is  integer,  stop.  Let  the  subproblem  list  be 
empty,  and  let  p  =  0.  Let  the  candidate  bound  C*  be  a  large  positive  value.  Per¬ 
form  Steps  1  through  4  and  go  to  Step  5. 

Step  (1):  Using  the  optimal  lagrangian  multipliers  from  Step  0.  Solve  (LR4)  to  yield  a  current 
bound  C  and  a  current  solution  (x,  >»). 

If  r  >  C*go  to  Step  4. 

Step  (2):  If  for  the  current  solution 
^  x„  1  for  any  i 

liJ 

where  J  =  I7  ly,  =  1 1 
go  to  Step  3. 

Update  the  candidate  bound  and  solution  (x*,  y*)  with  the  current  bound  and  solution 
and  remove  all  subproblems  from  the  subproblem  list  with  a  bound  equal  to  or  greater 
than  the  candidate  bound.  Go  to  Step  4. 


Step  (3); 


Let  /|  -  : 

^  \\ 

/2  = 

['■  1 L  =  0 

1  1 

1 

If  f|  =-  ^  go  to  Step  3.1 
Let  J,  =  \j\j  ^  J.  i  i  and  x„  -  1} 

Select  for  branching  the  variable  x*,  such  that 


Hli  max  Imax  !(</,/„  +  r’)/!,,]]. 

/  €  /  J  /  f  y, 


Add  the  subproblem  to  the  subproblem  list  and  go  to  Step  4. 
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3.1  Let  y,'=  {j\j  €  yand  object  ;  €  /j  is  not  prohibited  from  bin  y).  If  y/=  9  for  all  /  €  li  go 
to  Step  3.2.  Select  for  branching  the  variable  jc*,  such  that 

e*,  =  max  max  Krfjf,,  +  r,*)//„ll 
'^^2  /« y 

Add  the  subproblem  to  the  subproblem  list  and  to  to  Step  4. 

3.2  Let  J,"—  [j\j  ^  yand  object  /  E  /j  is  not  prohibited  from  bin  y).  If  J,"=  9(ot  any  /,  a  feasi¬ 
ble  solution  does  not  exist;  go  to  Step  4.  Select  for  branching  the  variable  xi^,  such  that 

0  L,  =  min  min  (v./a,)). 
itl^  it  J,' 

Add  the  subproblem  to  the  subproblem  list  and  to  to  Step  4. 


Step  (4):  Set  /2  —  p  -t-  1. 

Step  (5):  Select  from  the  subproblem  list  the  subproblem  with  the  least  lower  bound.  If  the 
subproblem  list  is  empty,  stop. 

Step  (6):  Let  Jr*/  =  0  and  perform  Step  1  through  4.  Let  x*,  =  1  and  y,  =  1  and  perform  Step 
1  through  4.  Go  to  Step  5. 


2.4  An  Approximate  Langrangian  Relaxation 

Recall,  from  the  branch-and-bound  procedure  for  (GBPP2)  just  presented,  that  the  optimal  multi¬ 
pliers  are  first  determined  by  column  generation  procedure.  The  branch-and-bound  scheme  is  then 
invoked  to  determine  the  optimal  primal  solution.  We  could,  of  course,  have  employed  a  subgradient 
approach  to  determine  these  multipliers,  indeed  such  an  attack  was  examined;  however,  the  usual 
computational  difficulties  arose  (e.g.,  step  size  adjustment,  etc.)  and  the  procedure  was  abandoned  after 
a  preliminary  investigation.  Regardless,  with  either  the  column  generation  or  the  subgradient  procedure 
no  further  adjustment  of  the  lagrange  multipliers  is  pursued  after  the  initial  optimization  due  to,  in  fact, 
the  computational  expense  incurred.  Suppose,  however,  it  were  possible  to  approximate  the  values  of 
the  optimal  multipliers  in  a  manner  that  was  not  computationally  expensive.  In  such  a  case,  it  might  be 
possible  to  recompute  multipliers  at  each  node  in  the  branch-and-bound  tree  without  sustaining  prohibi¬ 
tive  computational  effort.  Following,  we  develop  precisely  such  an  approach. 

Clearly,  one  method  for  approximating  the  multiplier  values  would  be  to  simply  use  the  optimal 
values  of  the  dual  multipliers  from  the  LP  relaxation.  It  is  the  case  however,  that  in  those  instances 
where  the  cost  of  unused  capacity  dominates  or  affects  significantly  the  optimal  solution  value,  the 
bounds  derived  by  using  the  optimal  LP  dual  multipliers  for  an  approximation  can  be  quite  weak.  We 
must  have  another  alternative  and  as  such,  consider  again  (LR4).  In  order  to  maximize  the  value  of 
this  relaxation,  we  seek  those  multipliers  r  which  solve  the  lagrangian  dual,  (LD6)  below; 


(73)  (LD6):  Maximize:  X  '’i  {  Minimum  ^  (c^  +  diWj)yi  — 


/-I 


/-)  ;-l 
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Subject  to;  r,  unrestricted 


i-l 

n 

Z  ^,y,  >  " 

/-I 


x„,  binary 


It  is  well  known  that  the  objective  function  of  this  problem  is  concave  in  the  r,  variables  [9], 

Suppose  now  that  all  the  lagrange  multipliers  are  constrained  to  be  equal  in  value.  Then  the  prob¬ 
lem  can  be  restated  as  follows. 


(74)  (LD7);  Maximize:  Z ''i  {Minimum  Z  “ 


1-1 


/-I 


Z  Z  +  Oxij 

/-I  (-1 


Subject  to:  =  /■  and  unrestricted  1  <  7  < 


for  1  ^  f  <  /I 


1-1 


Z  “/>’/  >  ” 

;-l 


Xij.  yj  binary 


or  equivalently, 


(75) 


Maximize;  nr  +  |  Minimum  ^  (c/  +  djWj)yi  — 

/-I 

Z  Z 

/-I  <-l 


Subject  to;  r  unrestricted 


Z  '>i^»  ^  **’/>'/■  1  ^  7  < 


Z  «/>'/  >  " 

;-l 


x„.  binary 


A  constraint  set  for  the  r,  variables  has  been  introduced  into  the  problem.  However,  these  constraints 
are  all  convex  and  so,  the  lagrangian  dual  now  pertains  to  the  maximization  of  a  concave  function  over 
a  convex  set.  Let  L*be  the  optimal  value  of  the  lagrangian  relaxation  without  the  constraints  on  r,  and 
*,  the  optimal  value  with  the  constraints  on  r,.  Since  the  latter  problem  is  constrained  and  the  former 
IS  not,  then  it  must  be  that  L*  <  L*. 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


VOL.  29,  NO.  1,  MARCH  1982 


GENERALIZED  BIN  PACKING  PROBLEM 


135 


The  restrictions  on  the  values  of  the  r,  variables  will  only  be  worthwhile  if  (LD7)  is  relatively  easy 
to  solve.  In  order  to  show  how  to  solve  this  problem,  we  first  need  to  establish  some  properties  for 
(LD7).  The  concavity  of  the  objective  function  in  r  has  already  been  established.  The  following  pro¬ 
perty  of  the  objective  function  also  holds: 

PROPERTY  1:  The  objective  function  of  (LD7)  is  piecewise  linear  in  r. 


PROOF:  This  property  follows  from  the  minimization  portion  of  the  objective  function  of  (LD7). 
Let  r  have  the  value  r'.  Then  there  exists  an  optimal  solution  to  the  minimization  part  of  (LD7).  Call 
this  solution  (x',  ^')  and  let  r'  change  by  an  amount  A.  For  A  small  enough,  the  optimal  solution  to 
the  minimization  problem  will  remain  unchanged.  The  change  in  r'will  cause  the  objective  function  of 

(  m  n  1 


(LD7)  to  change  by  an  amount 


«  -  Z  51  < 


A  which  is  linear  in  A.  If  A  is  made  large  enough. 


/-I  /'i 


then  the  optimal  solution  to  the  minimization  problem  will  change  and  the  objective  function  will  be 
defined  by  a  new  line. 


PROPERTY  2:  The  slope  of  the  ojective  function  of  (LD7)  at  any  point  /-'is 

m  n 

"  -  51  Z 

/-I  I- 1 

where  (x',  y')  is  the  optimal  solution  to  the  minimization  problem  in  (LD7)  at  r  =  r'. 


PROOF:  This  result  follows  directly  from  Property  1. 

The  plot  of  L  versus  r  for  a  typical  problem  mightjippear  as  shown  in  Figure  1.  The  line  seg¬ 
ments  are  numbered  1  through  7,  and  the  optimal  value  L*  occurs  at  the  intersection  of  line  segments 
3  and  4.  Since  the  objective  function  of  (LD7)  is  piecewise  linear  and  concave  in  r,  the  optimal  value 
will  always  occur  at  either  the  intersection  of  two  line  segments  or  anywhere  along  a  line  segment 
whose  slope  is  zero.  This  suggests  the  following  procedure  for  finding  the  value  of  r  which  optimally 
solves  (LD7).  We  note  that  other  alternatives  exist  for  the  solution  of  (LD7). 


L 
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Solution  of  (LD7) 

Step  (0):  Select  an  initial  value  for  €  >  0.  Let  m  +  =  r  =  0. 

Step  (1);  Solve  (LR4)  with  r,=  r  fo£  all  i.  Let  m  be  the  slope  of  the  objective  function  of 
(LD7)  at  r,  and  let  /  be  its  L  intercept.  If  m  =  0  stop.  If  m  >  0,  go  to  Step  3. 

Step  (2);  Let  =  m,  (_  =  /,  and  r.  =  r.  If  m ^  0,  go  to  Step  4.  Let  r  =  r  —  X  where 
X  >  0  is  a  step  size.  Go  to  Step  1. 

Step  (3);  Let  m+  =  w,  i+  =  /,  and  =  r.  If  m_  ^  0,  go  to  Step  4.  Let  r  =  r  +  X  where 
X  >  0  is  a  step  size.  Go  to  Step  1. 

Step  (4);  Let  r'  be  the  point  at  which  the  two  lines  defined  by  /+  and  m_,  /_  intersect.  If 
Ir  -  T-'l  <  «  stop.  Let  r  =  r\  and  go  to  Step  1. 

We  can  give  the  following  theorem  regarding  the  solution  determined  by  the  algorithm: 

THEOREM  4;  The  LD7  solution  procedure  terminates  with  a  value  of  r  which  is  at  most  a  dis¬ 
tance  €  >  0  from  r*  which  optimally  solves  (LD7). 

PROOF:  Clearly  the  procedure  terminates.  Either  a  line  segment  with  a  slope  of  zero  is  found; 
or  since  X  >  0,  eventually  a  point  is  found  on  one  side  of  r*  and  a  point  r_  is  found  on  the  other 
side.  The  interval  (r+  -  r_)  is  successively  reduced  in  magnitude  to  value  of  «  or  less.  If  the  pro¬ 
cedure  terminates  in  Step  1,  then  a  line  segment  for  the  objective  function  has  been  found  which  has  a 
slope  of  zero  at  r.  Any  point  on  this  line  segment  will  yield  an  optimal  solution  to  (LD7).  If  the  pro¬ 
cedure  terminates  in  Step  4,  then  r  is  either  on  the  line  defined  by  m  +  ,  /+  or  m_,  Without  loss  of 
generality  assume  that  r  lies  on  the  line  segment  defined  by  m+,  /+.  Then  at  the  point  r  the  objective 
function  of  (LD7)  slopes  downward  in  the  direction  of  negative  r,  and  at  the  point  r’  which  is  at  most  a 
distance  «  from  r  the  objective  function  of  (LD7)  slopes  downward  in  the  direction  of  positive  r.  Thus 
the  value  of  rat  termination  is  at  most  a  distance  c  from  r*  which  optimally  solves  (LD7). 

Before  this  procedure  can  be  implemented,  it  is  first  necessary  to  devise  some  means  of  determin¬ 
ing  the  step  size  X .  If  r  is  negative,  then  the  sign  of  the  terms  (<7,r„  -E  r)  can  be  used  to  gauge  the  step 
size.  For  example,  if  the  slope  is  negative,  then  there  are  too  many  objects  assigned  and  the  number 
must  be  reduced.  Lei  X  be  determined  as  follows: 

(76)  X  =  min  Kd,/,/  +  r)|(d,i„  +  r)  >  0). 

IJ 

Then  X  will  be  the  minimum  decrease  in  r  such  that  at  least  one  Jf„  coefficient  will  no  longer  be  posi¬ 
tive.  If  one  of  these  objects  was  previously  assigned,  then  it  would  no  longer  be  attractive  to  assign  it 
and  the  number  of  assigned  objects  would  decrease.  Conversely,  if  the  slope  is  positive,  then  there  are 
too  few  objects  assigned  and  the  number  must  be  increased.  Let  X  be  determined  as  follows: 

(77)  X  =  min  (— (<f,f„  +  /■)|(<f,r„  -E  r)  <  0). 

»./ 

Then  X  will  be  the  minimum  increase  in  r  such  that  at  least  one  x„  coefficient  will  no  longer  be  nega¬ 
tive.  Now  at  least  one  of  these  objects  can  be  assigned  and  the  number  of  assigned  objects  can 
increase. 

If  r  is  positive,  then  the  sign  of  the  terms  -E  r)  can  no  longer  be  used  to  gauge  the  step  size 
since  the  sign  will  always  be  positive.  Instead,  if  the  slope  is  positive,  let  X  be  determined  as  follows: 

(78)  X  =  min  \djid,  -  a,)\(d,  -  af  ^  0) 
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where:  =  the  maximum  number  of  objects  that  can  be  assigned  to  bin  ./ 

a,  =  the  current  number  of  objects  that  are  assigned  to  bin  ./. 

Since  there  are  too  few  objects  assigned  and  the  number  must  be  increased,  K  is  the  minimum  increase 
in  r  such  that  the  number  of  objects  assigned  will  equal  the  maximum  number  that  can  be  assigned  for 
some  bin.  To  see  this,  let  v,  =  i,  —  air  and  v,  =  /,  —  aj  where  v,  is  the  value  of  the  current  solution 
for  bin  j,  /y  is  its  value  when  r  =  0.  v,  is  the  value  of  the  solution  for  bin  ,/  when  the  maximum 
number  of  objects  are  assigned  to  bin  7,_and  /,  is  its  value  when  r  =  0.  These  two  solutions  have  the 
same  value  (i.e.,  Vy=Vy)  when  r=  (ij  —  ~  aj).  If  these  two  solutions  are  different  (i.e., 

a,  Uy),  then  /y  and  (y  must  differ  by  at  least  the  cost  of  one  unit  of  unused  capacity  (i.e.,  /,  =  /,  +  d,). 
Thus,  r  must  increase  by  at  least  d,/(dy  —  a,). 

If  r  is  positive  and  the  slope  is  negative,  let  X.  be  determined  by  setting  K  =  r.  Since  there  are  too 
many  objects  assigned,  the  number  must  be  reduced.  However,  it  does  not  seem  possible  to  determine 
a  minimum  decrease  in  r  such  that  the  number  of  assigned  objects  will  decrease.  Setting  r  equal  to 
zero,  which  is  what  this  step  size  does,  will  certainly  not  increase  the  number  of  assigned  objects  and 
will  probably  decrease  the  number  substantially. 


With  the  addition  of  the  step  size  calculations,  the  procedure  for  solving  (LD7)  can  be  restated  as 
follows: 

Updated  Solution  Procedure  for  (LD7) 

Step  (0):  Select  an  initial  value  fore  >  0.  Let  r  —  m  +  =  m_  =  0. 

Step  (1):  Solve  LR4  with  r,  =  r  for _all  i.  Let  m  be  the  slope  of  the  objective  function  of 

(LD7)  at  r,  and  let  /  be  its  L  intercept.  If  m  =  0,  stop.  If  r  >  0,  go  to  Step  3. 

Step  (2):  If  m  >  0,  go  to  Step  2.1.  Let  =  m  and  >-  =  /.  \f  0,  go  to  Step  2.2.  Let 

A  =  min  {(d,ryy  +  r):  (dit,i  +  r)  >  0).  Let  r  =  r  -  A,  and  go  to  Step  1. 

2.1  Let  =  m  and  (+ =  /.  If  m_  ^  0,  go  to  Step  2.2.  Let  A  =  min  +  r): 

l.l 

{ditii  +  r)  <  0).  If  |/.  7l(d,l„  +  r)  <  0)  =  0,  let  A  =  —  r.  Let  r  =  r  +  A,  and  go  to  Step  1. 

2.2  Let  r'  be  the  point  at  which  the  two  lines  defined  by  m  +  ,  /+  and  m,,  /_  intersect.  If 
Ir  —  r'l  <  «,  stop.  Let  r  =  r',  and  go  to  Step  1. 

Step  (3):  If  m  >  0,  go  to  Step  3.1.  Let  m_  =  m  and  i-  =  i.  If  ^  0,  go  to  Step  3.2.  Let 
A  =  r  Let  r  =  r  —  A ,  and  go  to  Step  1 . 

3.1  Let  m+=  m  and  i+ =  i.  If  ^  0,  go  to  Step  3.2.  Let  A  =  min  (d,/(a,  —  a,)l(a,  ~ 

I 

a,)  ^  0).  Let  r  =  r  +  A,  and  go  to  Step  1. 

3.2  Let  r'  be  the  point  at  which  the  two  lines  defined  by  w  +  ,  /+  and  W-,  intersect.  If 
|r  -  r'l  <  *,  stop.  Let  r  =*  r’,  and  go  to  Step  1. 

The  updated  (LD7)  solution  procedure  was  embedded  in  a  branch-and-bound  procedure  similar  to 
that  provided  earlier  for  (GBPP2).  In  fact,  the  only  difference  here  is  in  the  composition  of  Steps  0 
and  1.  We  present  the  following  specification  accordingly: 
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Algorithm  for  (GBPP2)  with  (LD7)  Solution  Incorporated 

Step  (0);  Let  the  subproblem  list  be  empty,  and  let  p  =  0.  Let  the  candidate  bound  C*  be  a 
large  positive  value.  Perform  Steps  I  through  4,  and  go  to  Step  5. 

Step  (1):  Solve  (LD7)  to  yield  a  current  bound  C  and  a  current  solution  (x,  ^).  If  C  >  C*, 
go  to  Step  4. 

Steps  (2),  (3),  (4),  (5),  and  (6)  are  the  same  as  for  the  previous  (GBPP2)  algorithm. 

3.  SAMPLE  PROBLEM 

In  this  section  we  present  a  small  example  problem  in  order  to  demonstrate  the  use  of  the  approx¬ 
imate  lagrangian  formulation  for  (GBPP2).  In  particular,  we  present  in  step-by-step  manner  the  appli¬ 
cation  of  the  algorithm  which  concluded  the  previous  section.  Consider  the  problem  given  by  the  table 
below: 


Sample  Problem:  //  =  5,  m  =  A, 


We  would  proceed  as  follows: 

Step  (0):  C*  =  °°,  p  =  0. 


Step  (1); 

0, 

II 

II 

II 

II 

rst 

II 

II 

r  =  -2, 

C  =  6, 

>'2=  3'4=  1- 

X|4  =  X44=  X54=  1 

X32  =  1  ,  X44  =  1 

r=  -5/3,  C=  6. 

>'2  =  >'4  =  L 

-<^12  =  -^32  “  1 

Step  (2): 

Step  (3). 

Go  to  Step  3. 

/,  ”  [ll.  /2=  (2) 

Ji  =  (2.  4) 

X,4  =  X44  =  Xs4  =  1 

0i4=  max  {1/6,  1/3)  =  1/3.  Add  the  subproblem  with 
r  ”  -5/3,  C  =  6,  and  X|4  as  the  branching  variable  to 
the  subproblem  list. 


Step  (4) :  p  -  1 . 

Step  (5):  Select  the  subproblem  with  r  =  -5/3,  C  =  6,  and  X|4 
as  the  branching  variable  from  the  subproblem  list. 

Step  (6):  X|4  -  0. 
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Step  (1):  r  =  —5/3,  C  =  19/3,  >’2  =  >'4  =  1.  A|2  =  xyi  —  1 

X44  =  Xs4  =  1 

r  =  0,  C=7,  j'2=>4=l-  x,2  =  X22  =  X42  =  1 

X44  “  X'54  =  1 . 

Step  (2):  Go  To  Step  3. 

Step  (3):  /,  =  {4l.  /2=  (31.  A=  (2.  4l, 

044  =  max  {1 ,  2l  =  2. 

Add  the  subproblem  with  /•  =  0,  C  =  7,  and  X44  as  the 
branching  variable  to  the  subproblem  list. 


Step  (4). 

p  =  2 

X,4=  l,y4 

=  1. 

Step  (1): 

> 

il 

1 

C=  6,  y2=  >-4=  1. 

X22  —  X32  —  1 

X)4  =  X44  =  X54  =  1 

Step  (2); 

C*=  6, 

II 

II 

II 

II 

#^4 

Xf4  =  XJ4  =  X5%  =  1 . 

Remove  the  subproblem  with  /■  =  0,  C  =  7,  and  X44  as  the 
branching  variable  from  the  subproblem  list. 

Step  (4):  p  =  3. 


Step  (S):  Stop. 
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Schematically,  the  solution  procedure  can  be  summarized  by  the  following  tree: 


Clearly,  we  have  that  chips  2  and  3  are  packed  into  bin  2  at  a  total  cost  of  4  (bin  cost  of  3  plus  unused 
penalty  of  1)  while  chips  1,  4  and  5  are  assigned  to  bin  4  at  a  cost  of  2  (bin  cost  only,  since  capacity  is 
used  entirely)  yielding  the  overall  solution  cost  of  6. 


4.  COMPUTATIONAL  RESULTS 

In  this  section  we  present  a  summary  of  what  was,  in  total,  a  rather  substantive  computational 
testing  activity  in  this  research.  More  precisely,  we  shall  present  empirical  results  relative  only  to  the 
behavior  of  the  approximate  lagrangian  models  on  randomly  generated  test  problems.  We  remark, 
however,  that  additional  tests  were  made  for  both  the  column  generation  and  subgradient  models  in 
which  the  results  were  almost  uniformly  poorer  than  when  the  approximate  lagrangian  approach  was 
examined.  Hence,  the  stated  concentration  in  this  section  follows  accordingly. 
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To  the  extent  that  we  present  results  for  the  case  of  only  the  approximate  lagrangian  models  the 
following  delineation  must  be  made  in  order  to  properly  interpret  the  ensuing  tabular  summaries. 
Recall  that  in  the  model  given  by  (LD6)  the  dual  multipliers  specified  by  the  vector  r  were  free  in 
terms  of  sign  distinction.  Interestingly,  if  we  restrict  the  sign  of  these  multipliers  such  that  r  <  0, 
there  is  little  effect  in  certain  problem  ranges,  on  overall  computational  effort.  However,  the  mean 
number  of  dual  iterations  increases,  thus  indicating  that  the  average  price  per  dual  iteration  is  less  with 
sign  restriction  imposed  than  without  such  restriction.  The  following  tables  are  organized  around  this 
differentiation. 

In  all  tables,  each  of  the  parameters  describing  (GBPPl)  are  varied  as  dictated  in  the  respective 
tables.  In  addition,  for  certain  sizes,  problems  which  were  both  correlated  and  uncorrelated  (relative  to 
f,,)  were  generated  and,  in  all  experiments,  two  replicated  at  each  level  of  n  and  m  were  performed. 
For  ease  of  presentation,  we  have  coded  the  ranges  of  parameter  generation  as  follows; 


c,~U  (8.  10) 

(a) 

c,-U  (40.  50) 

_  (b) 

d,~  U  (\.  2) 

(c) 

d,~  U  (5.  10) 

(d) 

i„  ~  U  (4,  6) 

(e) 

~  t/  (1,  9) 

_  (f) 

p  =  0.0 

_  (g> 

P  =  0.5 

_  (h) 

Note  that  the  space  for  ranges  on  the  parameter  w,  is  given  directly  in  the  tables  since  these  ranges 
were  generted  in  such  a  way  as  to  preserve  a  constant  average  number  of  bins  used  in  any  solution.  All 
computational  experience  reported  was  performed  on  a  CYBER  74  computer. 

In  Table  I,  a  summary  of  the  performance  of  the  approximate  lagrangian  approach  is  presented 
for  both  the  case  of  a  restriction  on  the  sign  of  the  multiplier  vector  r  and  the  case  where  r  is  not  res¬ 
tricted.  Calling  these  problems  set  /,  we  find  that  for  cases  ^  =  10.  m  =  5,  7,  9  and  n  =  15, 
m  =  5,  1 ,  9  the  procedure  averaged  less  effort  per  problem  where  no  restriction  or  r  was  imposed.  For 
the  remaining  levels  of  n  and  m  (in  Table  1)  the  opposite  outcome  occurred.  Observing  that  problems 
at  the  larger  values  of  n  dealt  only  with  the  "random"  case  of  p  =  0.0,  there  is  obviously  some  cause  for 
suspicion  regarding  the  effect  on  the  algorithm  relative  to  "harder”  versus  "easier"  problems.  To  lend 
some  insight  into  this  potential  dichotomy  in  performance,  a  second  problem  set  was  run  for  those 
problems  in  Table  1  where  the  parametric  range  was  fixed  to  coincide  with  those  (the  exception  is  with 
capacity)  for  the  case  n  =  25  and  m  =  5.  Note  in  Table  2,  the  cases  of  n  =  25,  m  =  5  and  n  =  20, 
m  =  1  are  excluded  since  clearly,  they  would  appear  as  in  Table  1. 

In  observing  the  average  computational  effort  required  per  problem  solved  in  problem  Set  II, 
there  does  appear  to  indeed  be  at  least  empirical  evidence  that  the  restricted  case  out-performs  the  case 
when  r  is  subject  to  no  sign  restriction.  Of  course,  if  sufficient  interest  existed  for  making  more  solid 
any  claims  regarding  the  performances  of  both  cases,  a  substantially  greater  experimental  design  than 
that  used  in  this  work  would  be  in  order.  Regardless,  to  gain  some  insight  into  computational  perfor¬ 
mance  on  larger  problems  we  have  applied  only  the  restricted  case  to  a  third  problem  set.  The  results 
appear  in  Table  3. 

An  overall  examination  of  the  computational  results  reveals  that  sizeable  effort  can  be  required  in 
a  variety  of  instances.  At  the  very  least,  there  is  substantial  disparity  in  minimum  and  maximum 
requirements.  Clearly,  subtle  parametric  changes  in  problems  otherwise  identical  can  create  vastly 
different  computational  demands. 
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TABLE  1  —  Approximate  Lagrangian  Procedure  —  Results  for  Problem  Set  / 


-  -  - 1 

Problem 

Size 

n  m 

Problems 

Attempted 

Solved* 

Parameuic 

Ranges 

Bin  Capacity. 

“V 

Timet 

(Seconds) 

Average 

Number 

of  Nodes 

Average  Number 
of  Dual 
Iterations 

10 

- 1 

5 

— 

64 

a,b,c,d,c. 

w,  ~  ty(9.  19) 

restricted 

2.780 

(0.082,  26  998) 

91  531 

101.625 

1 

64 

f.g.h 

H,  -  6  (20.  30) 

unrestricted 

_ 

1  915 

(0.079.  16  802) 

34.281 

53.484 

10 

7 

64 

a,b,c,d,e. 

~  6(9,  19) 

restricted 

4.515 

(0.058,  24.228) 

97.375 

1 10.094 

64 

f.g.h 

w,  -  V(20.  30) 

unrestricted 

3  740 

(0  124,  17,463) 

52.250 

_ 1 

75  563 

10 

9 

64 

a,b,c,d,e. 

w,  -  t/(9,  19) 

restricted 

8.580 

(0  155,  82  342) 

174.656 

184.750 

64 

f.g.h 

»■  -  t/(20.  30) 

unrestricted 

5.975 

(0  422,  39.819) 

68.156 

98.000 

1 

15 

5 

r  ~ 

64 

a,b,c,d,e, 

w,  ~  t/(16.  26) 

restricted 

14.721 

(0  529,  130  665) 

174.107 

213,750  1 

56 

f.g.h 

w,  ~  U(ii.  43) 

unrestricted 

12,707 

(0,887,  118  463) 

66,071 

109  107  1 

_ 

15 

7 

32 

a.b.c.d.e. 

»,  ~  6  (16.  26) 

restricted 

12  917 

(0.402.  95  639) 

157.452 

190.097 

31 

f.g 

H,  --  (7(33.  43) 

unrestricted 

12  503 

(0  839.  61  382) 

78.806 

128  742 

15 

9 

32 

a,b,c.d,e. 

K,  ~  i/(l6,  26) 

restricted 

25,434 

(1,040,  95,220) 

327.516 

349.613 

31 

f.g 

H,  ~  0  03  .  43) 

unrestricted 

20.590 

(0  569,  85.658) 

87.516 

159.097 

20 

5 

32 

•^b.c.d.e. 

w,  ~  {7(24,  34) 

restricted 

22.775 

(2.182.  93  355) 

129.000 

170.013 

24 

f.g 

K,  ~  {7(45.  55) 

unrestricted 

44  268 

(5.571,  139.323) 

151.167 

_ 

276.583 

20 

7 

16 

a.b.c.d.e. 

w,  -  {7(45.  55) 

restricted 

23  833 

(1  476,  85  .540) 

135  875 

282  438 

16 

f.g 

unrestricted 

41.482 

(5.616,  121.813) 

112  500 

186  438 

25 

5 

16 

a,b,c,d,e. 

K,  ~  {7(58,  68) 

restricted 

14.372 

(2.386,  59  413) 

53.222 

81. Ill 

9 

f.g 

unrestricted 

35.381 

(15.762.  84  891) 

86.111 

186  778 

*A  maximum  of  1000  nodes  in  the  branch-and-bound  tree  or  200  seconds  of  execution  time  (whichever  was  reached  first)  were 
used  as  cutoffs 

t  Average  (minimum,  maximum) 
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TABLE  2  —  Approximate  Lagrangian  Procedure  —  Results  for  Problem  Set  II 


Problem 

Size 

n  m 

Problems 

Attempted 

Solved 

Parametric 

Ranges 

Bin  Capacity, 

% 

- 1 

r 

Time 

(Seconds) 

Average 
Number 
of  Nodes 

Average  Number 
of  Dual 

Iterations 

10  5 

16 

16 

a,b,c,d,e, 

Tg 

Wj  ~[/(20,  30) 

restricted 

unrestricted 

0.937 

(0.082.  4.697) 

30.250 

32.063 

1 

1  209 

(0.201.  3.253) 

20  125 

29  313 

10  7 

1 

1 

16 

16 

a,b,c,d,e, 

f,g 

wy  -1/(20.  30) 

restricted 

unrestricted 

2.844 

(0.190.  13  784) 

55.875 

. 

68.688 

3.690 

(0.276.  17.463) 

42  750 

77,750 

10  9 

16 

16 

a,b,c,d,e, 

f.g 

w,-~UaO.  30) 

restricted 

unrestricted 

4.409 

(0  291.  38.743) 

85.250 

93.625 

4.776 

(0.422,  32.535) 

54.500 

80.875 

15  5 

16 

16 

a.b.c.d.e, 

f.g 

w,  -  1/(33.  43) 

restricted 

unrestricted 

5.115 

(0.620,  18.384) 

53,750 

87.563 

i 

8.936 

(1  120,  45.392) 

46.375 

95.688 

|l5  2 

: 

! 

16 

16 

a,b,c,d,e. 

f.g 

w,  -0(33.  43) 

restricted 

unrestricted 

7.770 

(0  402,  22.011) 

76.875 

121  813 

1 

9.811 

(1.390.  23.729) 

50.250 

96  313 

15  9 

1 

16 

16 

a.b.c.d.e, 

f.g 

w,  -{/(33.  43) 

restricted 

unrestricted 

9.877 

(1  424,  31.649) 

1 

83.125 

91  750  1 

1 

13  860 

(3.561,  74.410) 

21.875 

44.438 

( 

20  5 

16 

15 

a,b,c,d,e, 

Tg 

w,  -0(45.  55) 

restricted 

unrestricted 

20.226 

(2  182.  63  068) 

78  600 

117.067 

42.075 

(5.571,  138  323) 

120  200 

210.133 
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TABLE  3  —  Approximate  Lagrangian  Procedure—  Results  for  Problem  Set  III  (r  restricted  only) 


j  Problem  |  Problems  j 
i  Size  1  AUempied  j 

In  ml  Solved  i 

!  Parametric  i 

Ranges  ^ 

1 

Bin  Capacity. 

T 

1 

1 

Time 

(Seconds) 

T  '  ■ 

1  Average  Number 
i  of  Nodes 

1 

1 

.Aicrage  Number  J 
of  Dual  j 

lleralions  1 

r 

t  . 

1  a.b,c.d.e,  i 

K,-  U(45.  5^1 

26  568 

7  90  000 

-1 

97  938 

1 

i  __  '<>  K* _ 

1 

(3  402.  90  836) 

1 

1 

1 

2  '  16  !  a.b.c.d.e,  ' 

H,-  C(5R.  fi8J 

47  984 

131  167 

207  750 

' 

-  U  '2  J 

i 

(3  546,  148  075) 

4 _  -  --  . 

._  _J 

'  2S 

g  1  g  1 

a.b.c.d.f,  ' 

lj(58.  68» 

15  493 

1  115.750 

125  250 

1 

8  1 

(6  420,  27  098) 

j 

'  .10 

5  i  8  1 

1  a,b.c,d,f.  1 

H,-  U(70.  801 

1 

23  617 

1  202-750 

271.875 

I 

;  8 

8  1 

(14  446,  45  554) 

' 

i 

1  30 

7  1  8  1 

1  a,b,c.d,r  j 

K,-  U(70.  80) 

1 

37  407 

1  374  250 

1 

480.250 

1 

i 

1  8 

!  g  1 

1 

(10  793,  91.0771 

1 

1  35 

5  1  * 

1  a.b.c.d.f.  1 

U  (83,  93) 

T~ 

53  194 

374.750 

656.375 

1 

1 

1 

i  * 

!  * 

1 

(10  667,  100  373) 

1 

1 

I  40 

5  8 

1  a.b.c.d.f.  ! 

K,-  U(9S.  105) 

“T 

49  607 

148  000 

n 

180.500 

1 

j 

i  8  _  _L 

I 

(31.770,  63.353) 

J _ 

u 

_ i 

1  45 

5  !  4 

!  a.b.d.f.g  1 

H,-  U(120.  130) 

1 

92  894 

1  364  500 

! 

426  250 

1 

L.  _ 

!  ■* 

i  1 

1 

(66  278,  125  692) 

1 

1 

1 

i 

Naturally,  the  performance  of  the  procedure(s)  tested  here  would  be  enhanced  by  direct  compari¬ 
sons  with  alternative  approaches.  The  difficulty,  however,  is  that  such  experience  does  not  appear  to  be 
available  relative  to  (GPBBl).  Of  course,  various  special  cases  have  been  examined  as  suggested  earlier 
and  to  this  extent  the  procedures  tested  here  may  be  applied.  This  sort  of  comparative  analysis  would 
suffer,  of  course,  since  the  attack  presented  here  would  carry  unnecessary  computational  burdens  not 
required  in  treating  such  special  cases.  In  short,  the  models  we  have  developed  most  fully  and  which 
have  been  tested  in  this  analysis  cater  to  the  full  interpretation  of  (GPBBl).  Resolution  of  special 
cases,  including  the  classic  bin-packing  problem,  can  be  accomplished  more  efficiently  by  alternative 
means. 

5.  SUMMARY 

Our  entire  concentration  in  this  paper  has  dealt  with  the  optimization  of  a  combinational  problem 
we  have  referred  to  as  generalized  bin-packing.  Of  course,  one  does  not  require  the  experience  of  this 
treatise  to  realize  the  difficulty  of  the  problem;  however,  the  empirical  evidence  gathered  in  this 
research  does  indeed  tend  to  dismiss  any  suspicions  to  the  contrary.  In  this  light,  a  logical  avenue 
would  likely  be  to  pursue  some  nonexact  approaches  to  the  problem  in  much  the  same  way  as  has  been 
done  for  the  classic  bin-packing  model. 

Of  the  various  heuristic  attacks  we  have  postulated  and  examined,  most  were  constructed  directly 
from  optimization  procedures.  Stated  alternatively,  specific  heuristics  involved  suitable  relaxation  of 
the  optimization  models  presented  here.  In  other  cases,  nonexact  schemes  were  developed  as  direct 
extensions  of  the  any-fit  heuristics  discussed  in  Section  1.  In  general,  procedures  of  the  first  category 
were  not  (empirically  speaking)  particularly  noteworthy  in  that  computational  effort  frequently  rivaled 
that  of  the  optimization  attacks  used.  In  the  latter  case,  the  more  ad  hoc  procedures  often  generated 
solutions  of  almost  arbitrarily  poor  quality  resulting  largely  from  an  inability  to  make  allowances  for  the 
unused  capacity  penalty. 

It  is  worth  adding  that  all  testing  of  heuristic  developments  has  been  empirical  in  that  no  worst- 
case  analysis  was  pursued.  That  this  was  the  case  is  based  not  on  a  lack  of  interest  per  se  but  rather  on 
the  view  that  such  results  may  be  void  of  the  sort  of  insight  presented  by  similar  results  for  the  classic 
bin-packing  problem.  Again,  the  issues  here  would  appear  to  relate  to  the  arbitrary  cost  parameter 
structure  of  the  problem. 
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ABSTRACT 

A  mean-variance  portfolio  selection  model  with  limited  diversification  is 
formulated  in  which  transaction  and  management  costs  are  incorporated  as  the 
sum  of  a  linear  cost  and  a  fixed  cost.  The  problem  is  a  fixed  charge  inieger 
programming  problem  solved  by  hypersurface  search  using  dynamic  program¬ 
ming.  Fathoming  is  performed  in  the  forward  pass  of  dynamic  programming  so 
that  values  of  the  state  variable  which  correspond  to  infeasible  solutions  are  el¬ 
iminated  from  the  tables.  This  logic  permits  the  solution  of  problems  with  20- 
30  possible  investments. 


INTRODUCTION 

This  paper  presents  a  portfolio  selection  model  for  the  small  investor.  A  basic  reference  for  the 
portfolio  selection  problem  is  the  book  of  Markowitz  [4],  He  presents  the  portfolio  selection  problem 
in  which  there  are  two  criteria  for  the  best  portfolio:  maximum  return  and  minimum  variance.  Port¬ 
folios  are  called  efficient  if  they  meet  either  of  the  following  conditions:  if  for  a  fixed  variance  the 
return  is  maximized,  or  if  the  return  is  fixed  and  the  variance  is  a  minimum.  Sharpe  |7]  subsequently 
introduced  some  simplified  models  in  which  returns  on  securities  are  functions  of  the  performance  of 
one  or  more  market  indices.  Recently  Pang  (6)  has  formulated  and  solved  the  portfolio  analysis  prob¬ 
lem  for  large  scale  continuous  variable  problems  by  using  a  modified  version  of  the  parametric  principle 
pivoting  algorithm  for  the  class  of  multiple  index  portfolio  problems  with  positive  covariance  matrices. 
In  the  present  paper  we  follow  a  different  line  of  research.  It  has  been  shown  by  Jacob  [3]  that  the 
solution  of  portfolio  analysis  problems  by  standard  techniques  generates  answers  which  are  inappropri¬ 
ate  for  the  small  investor  because  of  the  large  number  of  positive  amounts  of  different  investments 
which  they  contain.  The  individual  investor  might  prefer  to  limit  the  total  number  of  his  investments  so 
that  he  does  not  have  to  pay  extra  charges  on  odd-lot  sales  and  keep  track  of  the  performance  of  a  great 
number  of  different  securities.  Jacob  formulated  several  single  index  models  of  the  portfolio  analysis 
problem  in  which  the  total  number  of  different  investments  is  contrained.  This  results  in  an  integer 
programming  problem.  Faaland  solved  another  integer  limited-diversification  model  parametrically  to 
generate  the  set  of  efficient  portfolios  [21.  Our  work  extends  his  model  by  allowing  transaction  and 
management  costs  to  be  expressed  explicitly  as  the  sum  of  linear  and  fixed  charges  in  the  objective 
function.  Then  we  use  a  technique  for  nonlinear  inieger  programming  which  is  based  on  dynamic  pro¬ 
gramming  [1]. 

Notation  for  the  Limited  Diversification  Model 

Let  us  follow  the  notation  for  Sharpe’s  single  index  portfolio  model.  The  return  for  security  ,i  is 
Rj  and 
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Rj  =  £•,  +/3j/  -  £(/))+€, 

where  £,  is  expected  return  on  security  J,  I  is  the  current  value  of  the  index  and  Ed)  is  its  expected 
value;  pj  is  a  parameter  and  (j  is  a  random  error  term  with  mean  zero  which  is  independent  of  /. 
Define  a^d)  and  as  the  variances  of  those  variables  and  assume  that  cov(e,,  e^)  is  zero  if  /  ^  J. 

Following  Faaland,  the  integer  problem  in  which  the  maximum  variance  B,  is  a  parametric  quantity  is 


n 


(1.1)  maximize 

I  CjXj 

7-1 

n 

(1.2)  subject  to 

I  QjX/  <  B, 

J-1 

(1.3) 

±X^^k 

/-I 

(1.4) 

At  most  T  of  the  Xy  may  be  positive 

(1.5) 

Xy  >  0 

7=1 . n 

(1.6) 

Xy  integer 

7=1 . n 

Let  us  define  Cj  =  Ejk,  j  =  1 . n 

Oj  =  Pj  ad)/ k,  j  =  1 . n. 

Qj  =  (T^(fj)/k^,  j  =  \ . n 

B,  =  cr},  r  =  1 . p. 

The  new  problem  includes  a  commission  charge  with  a  fixed  minimum  plus  a  component  which  is 
linear  in  the  investment  amount.  Then  (1.1)  becomes 

(1.7)  maximize  ^  (Cy  —  Fy)jir^  —  Mfiixj) 
y-i 

where  Vj  is  the  linear  charge,  Mj  is  the  fixed  charge,  and 

I  1  ifxy  >  0 
“jo  ifXy  =  0. 

THE  INTEGER  ALGORITHM 

The  problem  described  above  (1.2)-(1.7)  has  several  difficult  aspects.  In  (1.2)  the  representation 
of  portfolio  variance  is  a  nonseparable  function.  One  method  which  can  handle  such  functions  is  a 
nonlinear  integer  programming  technique  called  hyperplane  search  by  dynamic  programming.  Let  us 
outline  this  approach.  The  general  method  is  to  solve  a  sequence  of  problems  of  the  following  form 
which  differ  only  in  the  value  of  the  right  hand  side  by  dynamic  programming. 

maximize  z  =  Z  /)(x^) 

I  fjiXj)  -  z* 

0  <  /,  <  X,  ^  y  =  I . n 

X,  integer  y  =  1 . n. 
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This  problem  can  be  used  to  identify  all  integer  points  on  any  hypersurface  level  c*  .  If  is  an  upper 
bound  on  the  objective  function  value,  then  the  solution  of  the  single  dynamic  programming  problem 
above  can  be  used  to  generate  all  integer  points  upon  the  hypersurface  level  1,/^  =  cj,  and  for  all  right 
hand  side  values  less  than  z*.  In  fact  it  gives  information  as  to  which  values  of  z  contain  integer  points 
so  that  hypersurfaces  with  no  such  points  need  never  be  considered,  see  [1].  We  can  use  this  tech¬ 
nique  on  (1.7).  Then  the  constraints  of  the  original  problem  (1.2)-(1.6)  are  used  to  test  the  feasibility 
of  any  candidate  integer  points,  so  that  the  intractability  of  (1.2)  can  be  overcome.  If  we  choose  the 
sequence  (z^l  in  decreasing  order  (or  generate  integer  points  on  hypersurface  levels  with  decreasing 
objective  function  values)  then  the  first  feasible  point  is  optimal.  If  Iz^l  is  decreasing  then  this  method 
is  a  partial  enumeration  since  no  integer  points  are  considered  which  have  objective  function  values  less 
than  the  optimal  solution.  This  strategy  is  a  direct  application  of  the  hyperplane  search  method  to  lim¬ 
ited  diversification  portfolio  analysis.  The  complicated  form  of  the  objective  function  (1.7)  presents  no 
difficulty. 

A  Second  Approach 

Since  the  straightforward  method  given  above  requires  too  much  storage,  the  usual  method  of  cal¬ 
culating  the  dynamic  programming  recursion  tables  ("pulling")  was  changed  to  a  second  approach  which 
uses  a  technique  called  "pruning"  or  "fathoming"  to  reduce  the  size  of  the  tables  by  allowing  infeasible 
points  to  be  eliminated  during  the  forward  pass.  In  this  way,  the  constraints  (1.2)-(1.4)  can  be  used  to 
reduce  computation  and  storage.  The  idea  assumes  that  coefficients  in  the  constraints  are  positive. 
Under  this  assumption,  if  we  have  a  partial  solution  for  which  the  portfolio  variance  constraint  (1.2)  (or 
(1.3)  or  (1.4))  is  exceeded,  then  we  "prune"  the  entry  or  value  of  element  x,  which  caused  the  infeasi¬ 
bility  since  no  completion  of  that  partial  solution  will  be  feasible  [5).  This  method  of  fathoming 
reduces  both  the  storage  and  computational  burden  of  the  algorithm.  It  supplants  the  method  of  check¬ 
ing  partial  solutions  for  infeasibility  at  every  stage  in  the  backward  pass  of  dynamic  programming. 

Implementation  of  the  Parametric  Right>Hand-Side  Feature 

The  problem  (1.2)-(1.7)  has  a  parametric  right  hand  side  so  that  efficient  portfolios  are  generated 
for  various  upper  bounds  on  portfolio  variance.  Previous  work  [ll  has  shown  that  such  parametric 
problems  can  be  solved  from  a  single  dynamic  programming  problem  using  hypersurface  search, 
because  of  a  similar  property  obtained  in  any  dynamic  programming  solution.  If  an  allocation  problem 
is  solved  for  a  given  right-hand  side  value,  then  by  completing  the  last  return  function  table  for  all 
values  of  the  state  variable  all  problems  of  identical  form  but  with  smaller  right  hand  side  values  can  be 
obtained. 

Computational  Results 

The  model  selected  for  computational  results  is  that  given  in  (1.2)-(1.7),  a  nonlinear,  nonsepar- 
able  parametric  fixed  charge  problem.  Problem  coefficients  were  generated  randomly  for  various 
numbers  of  total  possible  investments  from  10-30.  The  times  for  computer  runs  on  the  CYBER  73  are 
given  in  the  following  table  of  summary  statistics.  For  each  problem  size,  statistics  are  averaged  for  ten 
problems  with  random  coefficients.  The  second  through  the  tenth  problems  are  parametric  versions  of 
the  first  and  therefore  take  less  time. 

This  problem  is  highly  specialized.  Therefore  it  is  difficult  to  find  computational  comparisons  with 
it  in  the  literature.  However  some  recent  work  on  a  linear  integer  bicriteria  problem  [81  may  be 
relevant.  Villarreal  and  Karwan  have  given  results  for  three  ten-variable  problems  with  four  constraints 
which  are  solved  in  an  average  of  18.43  CPU  seconds  each  on  a  CDC  6400  computer  system.  Their 
algorithm  generates  all  distinct  efficient  points  (an  average  of  7)  in  comparison  to  ours  which  generates 
efficient  points  according  to  ten  desired  levels  of  portfolio  variance.  These  authors  obtain  additional 
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TABLE  1  —  (fib-  in  CPU  Seconds) 


Portfolio 
Variance  =  B, 

n  =  10 

n  =  20 

n  =  30 

1.5 

0.1/.06 

6.217.3 

34.74/36.43 

1.35 

.OO/.OO 

.Ol/.Ol 

.01/.02 

1.2 

.Ol/.Ol 

.08/. 19 

.08/  15 

1.05 

.OO/.Ol 

.04/.09 

.54/. 85 

.9 

.OO/.Ol 

.28/.49 

.19/. 44 

.75 

.01/.02 

.43/. 64 

2.59/4.62 

.6 

Ol/.Ol 

1.12/2.06 

.81/1.20 

.45 

.01/.02 

1.04/1.68 

8.78/11.47 

.3 

.02/.03 

2.38/4.29 

3.64/6.85 

.15 

.02/.02 

2.36/3.16 

11.96/13.32 

improvements  with  various  hybrid  dynamic  programming  recursions,  and  get  their  best  times  (1.19 
CPU  seconds  for  ten  variables)  with  a  branch  and  bound  approach.  Our  method  seems  competitive, 
particularly  when  the  nonlinearities  and  nonseparability  of  the  portfolio  problem  are  considered,  along 
with  the  larger  size  of  problems  solved.  It  is  difficult  to  make  a  more  exact  comparison  because  of  the 
different  structure  of  the  problems  in  (8)  and  the  portfolio  problem. 
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ABSTRACT 

In  this  paper  the  n/l/r,  ^  O/E  h',C,  problem  under  the  assumptions  of 
J 

nonpreempiive  sequencing  and  sequence  independent  processing  times  is  inves¬ 
tigated  After  pointing  out  the  fundamental  properties,  some  dominance 
sufficient  conditions  among  sequences  arc  obtained  and  a  branch  and  bound  al¬ 
gorithm  is  proposed  Computational  results  are  reported  and  discussed 


1.  INTRODUCTION 

The  one  machine  scheduling  problem  has  been  studied  extensively  under  different  hypotheses  and 
objective  functions.  Nevertheless,  in  the  literature,  emphasis  is  laid  upon  the  case  with  equal  ready 
times,  no  imposed  due  dates  and  flow  time  (or  equivalent  function)  as  objective  function.  In  this  con¬ 
text  the  problem  solution  is  trivial  since  the  Smith’s  rules  (14],  well-known  as  shortest  processing  time 
(SPT)  rule  and  weighted  shortest  processing  time  (WSPT)  rule,  provide  an  optimal  solution.  Accord¬ 
ing  to  these  rules,  jobs  are  sequenced  on  the  basis  of  a  preestablished  order  of  their  processing  times. 

In  (7),  [8],  (12],  (13],  and  (15],  a  more  sophisticated  cost  function,  such  as  weighted  tardiness,  is 
adopted  for  it  turns  out  to  be  a  suitable  model  for  many  real  problems  in  which  jobs  may  be  considered 
available  simultaneously  for  processing  on  a  single  machine  in  order  to  minimize  the  total  sum  of  the 
delays  with  respect  to  their  due  dates.  One  of  the  most  constraining  assumptions  is  equality  of  the 
ready  times  of  the  jobs 

As  a  matter  of  fact,  whenever  different  ready  times  are  to  be  considered  (r,  >  0),  the  previous 
models  and  the  corresponding  algorithms  are  no  more  adequate. 

The  case  r,  ^  0  has  been  recognized  in  research  on  other  single  machine  problems,  where  due 
dates  were  also  taken  into  consideration  (ll,  (51,  and  UOl.  However,  in  these  works,  the  authors  inves¬ 
tigate  the  properties  of  a  rather  different  cost  function  given  by  the  maximum  lateness  (or  tardiness) 
with  weights  w,  =  1  for  all  j. 

In  (2],  (3l,  [4],  and  (11],  refet'ing  to  scheduling  theory  application  to  air  traffic  control,  minimiz¬ 
ing  total  weighted  waiting  time  in  one  machine  scheduling  problem  with  unequal  ready  times  has  been 
considered. 

An  implicit  enumeration  procedure  has  been  developed  and  a  simplified  algorithm  (w^  -  l.Vy) 
has  been  implemented  in  order  to  work  in  real  time.  The  interested  reader  may  address  to  references 
(111  and  (4]  for  a  complete  discussion  of  this  application.  The  aforementioned  scheduling  problem, 
although  simple  in  these  terms,  is  not  trivial  and  it  may  be  proved  to  be  NP-complete  (9].  The  main 
difficulty  arises  from  the  fact  that,  since  r,  ^  0,  idle  times  may  be  inserted  in  the  optimal  schedule  (6]. 
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In  this  paper,  starting  from  the  results  obtained  in  previous  works  of  ours  [2],  [3],  (4),  and  [11] 

the  «/l/r,  ^  0/  Z  problem,  with  no  preemption  allowed  and  sequence  independent  processing 

./ 

times,  is  analyzed.  The  purpose  is  to  establish  some  further  dominance  properties  and  above  all  to 
present  a  much  improved  branch  and  bound  algorithm.  Computational  results,  obtained  with  this  algo¬ 
rithm,  are  also  reported  and  discussed. 


2.  DEFINITIONS  AND  PROBLEM  STATEMENT 

Let  N  =  {j\j  =1.2 . n)  be  a  set  of  jobs  to  be  processed,  one  job  at  a  time,  on  a  single,  con¬ 

tinuously  available,  machine.  For  each  job  j,  the  ready  time  r,,  the  processing  time  p,,  and  the  weight 
H',,  are  given. 

We  call  sequence  on  set  K  C  /V  any  permutation  in  K  which  is  indicated  with  s*,  being  Sq  the 
sequence  on  the  void  set.  Completion  of  all  jobs  requires  establishing  a  sequence 

s„  =  (,/ji].,/(2| . 7j„|).  When  a  job  parameter  is  identified  by  the  job’s  position  in  a  given  sequence 

rather  than  its  index  number,  the  position  is  indicated  in  square  brackets.  Thus  Cj,]  means  the  comple¬ 
tion  time  of  whichever  job  occupies  the  ^h  position  in  the  sequence. 

Suppose  now  that  a  sequence  is  constructed  by  adding  one  job  at  a  time,  starting  from  position  1. 

At  any  step,  we  have  a  partial  sequence  s*  =  (yju.yjji . j\kO  of  ^  Job  set  K  C  N.  It  can  easily  be 

verified  that,  in  our  setting,  the  completion  time  of  such  a  sequence  cannot  be  expressed  only  in  terms 
of  processing  times  regardless  of  the  order  in  which  jobs  are  scheduled,  since  idle  times  can  be  inserted 
[6].  This  fact  suggests  some  useful  definitions. 

_  DEFINITION  1:  Given  a  partial  sequence  s^.,  the  earliest  start  time  of  a  job  J  €  A,  where 
A  =  yV  -  A  is  the  set  of  jobs  not  sequenced,  can  be  expressed  as 

(1)  =  max  (r,,  Ci*|) 
with  r,  =  r,.  Vj  (i  N,  if  k  =  0. 

DEFINITION  2:  Given  a  partial  sequence  s*,  the  completion  time  of  a  job  j  €  K.  C,{Sk).  can  be 
expressed  as 

(2)  Ci(Sk)  =  i/iSk)  -I-  Pi- 

DEFINITION  3:  Let  K  be  any  subset  of  N  with  cardinality  k.  then  the  cost  function  associated  to  s*  is 
CMsJ  =  £  w,,|C,,|. 

If  k  =  n,  C''(s„)  expresses  the  total  weighted  completion  time  of  s„. 

Then,  the  problem  can  be  formally  stated  as  follows:  Given  a  job  set  N,  find  a  sequence  s*  such  that 
C“(s:)  <  C’’(s„),  Vs„  €  Sn. 


To  fully  understand  the  problem  framework,  it  is  suitable  to  introduce  the  reduced  problem  con¬ 
cept.  A  definition  of  reduced  problem  may  be  found  in  references  [7]  and  [15].  However,  that 
definition  is  given  by  considering  the  sequence  completion  time  as  computed  regardless  of  the  order  of 
the  jobs.  As  pointed  out  before,  this  is  impossible  in  our  setting;  a  more  general  definition,  which  also 
holds  in  our  hypotheses,  is  the  following. 
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DEFINITION  4;  A  problem  is  said  to  be  reducible  if  a  partition  (yV).  ^2)  of  set  /V  exists  such 
that,  if  sj,  sf,  $2  are  the  optimal  sequences  with  respect  to  C"!-)  on  N,  Ai.  N2,  respectively,  it  is  true 
that  s*  =  sf  sj. 

REMARK  1:  It  is  easy  to  prove  that  a  given  problem  n/\/r,  >  O/I  WjCj  is  certainly  reducible  if 

j 

a  job  set  iV,  and  a  corresponding  optimal  sequence  S|  exist  such  that  completion  time  of  s,  is  less  than 
(or  at  most  equal  to)  the  earliest  ready  time  associated  with  N2. 

In  [11]  a  simple  decomposition  method  to  transform  a  given  problem  into  reduced  problems  is 
outlined. 

Therefore,  without  loss  of  generality,  from  now  on,  only  reduced  problems  will  be  dealt  with. 

DEFINITION  5:  Given  a  partial  sequence  s*,  a  Job  set  V  (Z  K  is  said  to  be  "dense"  if  any 
sequence  j\.  j2.  ...  7).,  on  Fhas  no  inserted  idle  times. 

It  is  easy  to  verify  that 

7  6  F  iff  t  isi,)  <  min  CAsl). 

jiK 

DEFINITION  6;  Given  two  sequences  s*  and  s,,  both  belonging  to  set  S,  s*  dominates  S/,,  if  and 
only  if,  it  is  true  that  there  exists  an  such  that  s*  s„_n  =  s'  belongs  to  and 

<  CCsnlsft)  for  all  s„|s/,. 

DEFINITION  7.  Two  sequences,  s*  and  s,,,  are  equivalent 
if  C"(s;|s*)  =  C“(s:ls,). 

REMARK  2;  On  the  basis  of  the  previous  definitions  it  follows  immediately  that  an  optimal 
sequence  sj  dominates  any  other  one,  and  it  is  equivalent  to  all  its  own  partial  sequences  s^, 
/c  =  1,  2 . n. 

DEFINITION  8;  A  sequence  is  said  to  be  an  ECT  sequence  if  it  satisfies  the  earliest  completion 
time  (ECT)  rule,  i.e.,  if  Cn  +  n  =  min  C,(sn),  0  <  k  ^  n-\. 

UK 

Ties  are  broken  by  choosing  j  with  min  r,  and  further  ties  by  choosing  ,/  with  max  w,  and,  at  last, 
by  choosing  j  with  min  j. 

DEFINITION  9:  A  sequence  is  said  to  be  an  EST  sequence  if  it  satisfies  the  earliest  start  time 
(EST)  rule,  i.e.,  if  each  actual  job  start  time  is  such  as 

=  min  0  <  ^  1. 

JiK 

3.  OPTIMALITY  AND  DOMINANCE  CRITERIA 

In  this  section  a  set  of  properties,  which  enhance  the  efficiency  of  the  search  for  an  optimal  solu¬ 
tion,  is  given.  Some  of  these  properties  (specifically  those  resulting  from  Theorems  2,  3,  4,  5  and 
Corollary  2)  have  already  been  demonstrated  in  a  previous  paper  [111.  The  other  properties,  expressed 
as  Theorems  1,6,7  and  Corollaries  1  and  3,  have  here  been  carried  out  in  order  to  improve  the  algo¬ 
rithm  performance.  Extended  proofs  of  these  further  properties  are  reported  in  the  Appendix. 
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THEOREM  1:  Given  a  job  set  A/ and  a  partial  sequence  s*(/c  <  n).  if,  for  /  €  K 

Pi  Pi  — 

(a)  —  Vj  €  K 

W’l 

(b)  f,(sj  <  h  ^  K 

then  Sj/ "dominates"  St^h. 

Clearly,  the  property  expressed  in  Theorem  1,  in  the  case  with  k-,  =  1  Vi,  can  be  applied  in  the 
following  way:  "Given  a  partial  sequence  s*,  consider,  among  all  the  remaining  jobs,  job  i  with  the  least 
processing  time  and  then  take  out  of  consideration  for  A  +  1th  position,  all  jobs  unable  to  start  before 
job  /.  This  property,  of  course,  appears  to  be  efficient  since  earlier  job  i  is  able  to  start.  For  instance,  if 
job  I  has  both  minimum  processing  time  and  earliest  start  time,  no  other  job  may  be  considered  for  pos¬ 
sible  inclusion  in  the  (k  +  1 1th  position  in  the  sequence. 

COROLLARY  1;  An  EST  sequence  of  job  set  A  is  optimal  if  it  is  ordered  according  to  WSPT 

rule. 

THEOREM  2:  Given  a  job  set  A,  a  partial  sequence  (k  <  n)  and  two  jobs  ij  €  K.  If 
r,  ^  C|(s*),  then  s^/dominates 

From  this  theorem  it  follows  that  is  is  possible  to  restrict  search  for  optimal  solution  to  the  class 
of  active  schedules  (II. 

In  Figure  I  an  example  of  the  application  of  Theorem  2  is  given. 


Scheduled  jobs 


1 

1 

1 

1 

! 

K-  (h,i,l,m) 

h 

□ 

h,l  are  candidates 

1 

• 

1  i 

j 

1 

i  1 

1  1 

l.m  are  not  candidates 

>  I 


— I - 

min  C  (s  ) 
jEK  J 


time 


FiouRt  1  Example  of  application  of  Theorem  2;  candidates  to  ( A  +  Dili 
position  in  the  sequence 


THEOREM  3;  Given  s*  and  two  jobs  i.  j  €  A,  if 


(a)  tv,  ^  IV, 

(b)  C,(s*)  ^  C,(s*) 

(c)  tV,(C,(s*)  -  C,(5t)l  +  IV, p,  -  IV, p,  >  (p, 
where  0  is  the  set  K  —  (/, ./),  then  s*/' dominates  s*/ 


P,)  L 

HQ 

i 
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THEOREM  4;  Given  s*  and  two  jobs  /.  j  €  K,  if 

(a)  w,  ^  tv, 

(b)  C,(sJ  ^  C,(sv) 

(c)  w,p,  -  ^  lC,(s^)  -  C  ,(sjl  ^  vv,  +  8_,  (p,  -  p,) 

itk 

■  wy-  wJ 

/€A:  j 

then  dominates  s^j. 

Properties  resulting  from  Theorems  3  and  4  may  be  practically  utilized  in  order  to  operate  a 
further  selection  within  the  class  of  active  schedules.  Of  course,  from  a  theoretical  point  of  view  they 
could  also  be  applied  within  the  more  general  class  of  permutation  schedules.  It  is  easily  seen  that,  in 
general,  the  dominance  concept,  as  expressed  in  Theorems  3  and  4,  depends  not  only  on  the  input  vari¬ 
ables  (weight  coefficients,  processing  and  ready  times)  associated  to  the  couple  of  jobs  taken  into  con¬ 
sideration  but  also  on  the  input  variables  associated  to  the  remaining  jobs  (specifically  on  their  weight 
coefficients  or  on  their  number  if  w,  =  I  Vi), 

THEOREM  5:  Given  a  job  set  Nand  two  partial  sequences  x-iy/and  xjyi.  If 

(a)  iyj  and  jyi  have  no  inserted  idle  times 

(b)  lC,(x)  -  C,(x)l  ^  Wi  +  (w,  -  H.yl  X,  p,  +  WjPi  -  WiPj 

KP  HP 

>  8-1  (/,(x)  -  r,(x)l  ^  IV, 

/  •<  L 

where  P  is  the  job  set  of  iyJ  and  (/  the  set  N  -  P  -  |x  1 ,  then  xiyj  dominates  xjyi. 

COROLLARY  2;  Given  s*  and  two  jobs  /.  j  €  K.  Let  K  be  the  set  "dense,"  if 

(a)  IV,  ^  w, 

(b)  /,(s*)  >  /,(s*) 

(c)  C,(sJ  ^  r,(s») 
then  s*/ dominates  sj. 

THEOREM  6;  Given  a  job  set  N  and  a  partial  sequence  <  n),  for  any  sequence  (s„ls*)  a 
lower  bound  C'“(s„  I  s*)  <  C"( 1 5*)  exists,  where 

C-Xsjs*)  =  C"(s*)  -E  "f 
;-l 

/-I  /-iv! 

-  8-1  (C*  +  ,(s*)  -  Ct,.,(s*)])  •  min(»’*  +  ,.  ivt+,), 
being  k  +  ;  and  k  +  /  6  A  with  <  r^^i. 

The  first  term  of  the  lower  bound  provided  by  Theorem  6  is,  simply,  the  value  of  the  objective 
function  associated  withjhe  partial  sequence  s*.  The  second  term  takes  into  account  that  each  job  of 
the  complementary  set  K  cannot  start  before  its  earliest  start  time.  The  third  term  takes  into  account 
that  processing  times  of  the  unscheduled  jobs  are  not  to  be  overlapped  in  the  final  sequence. 
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An  example  of  application  of  this  lower  bound  is  reported  in  Figure  2. 


+  0^  •  min(W  ,W,)  +  O  •  mln(W  ,W  ) 
hi  h  1  11  11 


FioLRt  2  Example  of  lower  bound  compulation  for  a  pariial  sequence  s* 

THEOREM  7:  Let  s®  be  a  sequence  on  Afsuch  that,  for  0  ^  /c  ^  —  1 

(a)  +  =  Ci^i 

(b)  C(*  +  ||  =  min  C/fs*) 

/€! 

(C)  IV(*1  >  lVn,.,| 

being  C(o|  =  r,.  i  €  N,  then  s°  dominates  any  other  sequence  s“  of  the  same  set  starting  with  job  h 
such  as  r,  <  r^. 

COROLLARY  3.  An  ECT  sequence,  defined  on  a  job  set  -Vand  ordered  according  to  nondecreas¬ 
ing  weights,  is  optimal  if  it  has  no  inserted  idle  times  and  begins  with  the  job  having  the  smallest  ready 
time. 

4.  THE  PROPOSED  APPROACH 

To  solve  the  problem  under  study,  an  implicit  enumerative  procedure  is  proposed,  based  upon  a 
branch  and  bound  concept.  It  involves  a  search  along  the  branches  of  a  tree,  in  which  any  node  at  level 
k  represents  a  partial  sequence  s*  defined  on  a  set  K  with  cardinality  k. 

As  a  consequence,  subsequent  jobs  in  Sj  correspond  also  to  a  succession  of  nodes  encountered  by 
going  from  tree  root  Sq  to  node  s*.  For  each  node  s*,  as  illustrated  in  Section  3.  a  lower  bound 
C"(5,|.5*)  on  r"(s„Un)  is  computed,  and  the  current  optimal  solution  is  utilized  as  upper  bound.  At 
each  iteration,  the  node  being  expanded  is  called  the  current  node,  whose  last  job  has  the  lowest 
current  value  of  earliest  start  time.  A  closed  node  is  one  whose  correspondent  partial  sequence  has 
been  dominated  and,  therefore,  it  will  be  no  longer  considered.  To  identify  dominated  nodes,  some 
elimination  conditions  based  upon  the  previous  theorems  are  then  applied.  In  this  way,  the  number  of 
nodes  branching  from  the  current  node  can  be  effectively  reduced. 
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The  proposed  procedure  is  a  recursive  one.  In  fact,  whenever  ^partial  sequence  s*  on  /V  has  been 
fixed,  the  optimal  sequencing  problem  of  the  jobs  belonging  to  set  K  is  identical  to  the  initial  problem 
if  their  ready  times  are  shifted  so  that  none  of  them  is  less  than  Cfs*). 

5.  THE  ALGORITHM 

The  algorithm  consists  of  three  basic  phases:  initialization,  branching  and  termination.  Initializa¬ 
tion  involves  defining  initial  value  of  the  variables,  determining  an  initial  sequence  and  testing  it  for 
optimality. 

Branching  is  an  iterative  procedure.  In  each  iteration,  it  generates  the  descendents  of  the  current 
node,  eliminates  closed  nodes,  updates  the  set  of  job  candidates  to  successive  position,  identifies,  as 
next  current  node,  that  one  having  last  job  with  lowest  earliest  start  time,  and  computes  the  lower 
bound  for  it. 

The  termination  phase  consists  of  identification  of  the  optimal  sequence  s*  and  computation  of 

Initialization 

(1)  Let  Abe  the  job  set  [j\j  =1,2 . «1.  Define  the  corresponding  parameter  sets: 

^  =  I'’!,  zj . r, . r„).  P  =  lp|,  P2 . p„]. 

=  (wi.  W2 .  ^n)  where  z,.,  ^  z,  ^  z,+  ,. 

If  Zy  =  z,  +  |  call  y  job  with  min  p, 

(2)  A:  -  0.  A  =  0.  s*  =  0,  A  =  A.  C(s*)  =  z,. 

(3)  Construct  an  initial  solution  s„  by  means  of  an  heuristic  rule  (for  example  EST  or  ECT)  and 
compute  C"(!„). 

(4)  Test  initial  solution  for  optimality  by  means  of  Corollary  1  and  Corollary  3.  if  s„  passes  the 
test  go  to  Step  22. 

Branching 

(5)  Determine  i,(s*)  =  max  (z,.  C(s*)l  and 

UK 

C„(sJ  =  min  (l/(s*)  -(-  p,} 

/(A 

(6)  Form  the  set 

O,,  -  {J\j  €  K.  r/s*)  <  C„(s*)l 

(7)  if  Dj^  =  A  go  to  Step  18 

(8)  Generate  nodes  s^j,  V  j  ^ 

(9)  For  each  node  s*y  dominated  according  to  some  of  the  Theorems  1,  3,  4,  5  (and  Corollary  2 
(if  Step  8  has  been  reached  coming  from  Step  19)),  update  set  by  setting 

D,  =  D,  —  j 
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(10)  Sel  k  =  k  +  \ 

(11)  If  I  =  0,  set  <:  =  A:  —  I  and  go  to  Step  16 

(12)  Set  D,  .  —  —  J',  5*  =  Sj-ij',  K  =  is*l,  K  —  N  —  K  with  y':  r,  =  min  /,(s*-|) 

A  -  !  A  - 1  /to. 

‘a- I 

(13)  Compute  lower  bound  C'"(5„U*) 

(14)  If  C“(sjs*)  ^  C“(s„)  go  toStep  11. 

(15)  Set  C(sa)  =  /,  +  p;  and  go  to  Step  5. 

(16)  If  /c  =  0,  go  to  Step  22. 

(17)  Go  to  Step  11. 

(18)  If  t,(sj  =  t.  Vj  C  iC.  go  to  Step  20. 

(19)  Go  to  Step  8. 

(20)  Form  the  sequence  where  is  the  sequence  on  K  satisfying  the  WSPT  rule. 

(21)  Set  s„  =  C^Xs,)  =  C"(s„ls*)  and  go  to  Step  11). 

Termination 

(22)  Set  s*  =  s„  and  C'is*)  =  CXs,).  STOP. 

Figure  3  shows  a  synthetic  flow-chart  which  illustrates  the  algorithm  logic. 

6.  COMPUTATIONAL  RESULTS  AND  CONCLUSIONS 

The  algorithm,  previously  illustrated,  has  been  implemented  on  a  Digital  PDP  11/34  minicom¬ 
puter. 


Different  series  of  tests  have  been  performed  by  considering  a  given  number  of  jobs  n  —  10.  For 
each  series  the  p,  values  have  been  chosen  uniformly  distributed  over  a  fixed  interval  [1  -h  7]  and  the  ^ 
values  have  been  chosen  uniformly  distributed  over  five  different  lime  intervals  of  growing  length. 
Moreover,  for  each  fixed  range  of  r,,  the  weights  have  also  been  taken  as  uniformly  distributed  over 
ten  different  intervals  of  growing  length.  The  results  are  fully  reported  in  Table  1  while  in  Figures  4 
and  5  are  reported  the  diagrams  showing  the  number  of  nodes  generated  versus  max  r,  and  max  Wy, 
respectively.  As  it  can  be  observed  a  significant  increase  in  all  the  parameters  is  met  whenever  weight 
coefficients  tend  to  be  unequal.  In  particular,  computational  results  show  that  changes  in  the  observed 
parameters  are  relatively  less  dependent  on  the  width  of  the  weights  range.  Rather,  these  changes  seem 
to  arise  whenever  a  mere  difference  among  the  weights  is  considered.  Figure  4  and  5  show  this 
behavior  for  the  number  of  nodes  generated. 

Nevertheless,  the  computation  time,  in  the  worst  case,  does  not  exceed  a  few  tens  of  seconds, 
even  though  a  fast  computer  has  not  been  used. 

A  last  remark,  involves  the  fact  that  the  typical  behavior  (bell-shaped),  observed  in  (41  when 
w,  =  1 ,  V  j,  is  confirmed  also  if  are  distributed  on  a  fixed  interval  (Figure  4). 
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TABLE  1  —  Summary  of  the  Results  in  the  Case  n  =  10  and  Range  ofPi 
Number  of  Tests  for  Each  Pair  of  Intervals  Relative 
to  tj  and  Wj  =  100.  Total  Number  of  Tests  Performed  =  5000 


Range  of 


nodes  generated 


f-iouRE  4.  Nodes  generated  versus  maximum  ready  time 


FiotRE  5  Nodes  generated  versus  maximum  weight 


In  conclusion,  the  proposed  algorithm  appears  to  behave  in  a  satisfactory  way,  at  least  when  the 
number  of  jobs  does  not  exceed  10. 

However,  to  fully  evaluate  the  algorithm  performance,  more  extensive  tests,  by  increasing  the  job 
number  and  the  width  of  the  weight  ranges,  by  considering  also  processing  time  ranges  of  different 
width  and  by  making  use  of  a  fast-time  computer,  should  be  provided. 
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APPENDIX 


Proof  of  Theorem  I : 

Consider  a  schedule  s„  =  having  h  in  position  k  -E  1  and  /  in  position  ^  >  A-  -E  1.  Let  s’ 

be  a  schedule  that  differs  from  s„  only  in  that  Job  /  is  interchanged  with  its  immediate  predecessor  /  in 
position  V  -  1.  The  first  (,v  -  2)  jobs  have  exactly  the  same  completion  times  under  s„  and  and. 
therefore  make  the  same  contribution  to  total  weighted  completion  time. 

Schedules  5„  and  differ  only  in  the  completion  times  of  the  (n  -  v  -E  2)  remaining  Jobs. 

As  far  as  the  last  (n  -  v)  Jobs  are  concerned,  which  are  the  same  in  both  sequences,  it  can  be 
derived  that 

(1)  AC"(>')=  t  >v,„C,„(s„)-  £  wp,C|„(s;^0. 

/-r+i  /->  +  ! 
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In  fact,  by  assumption  b),  it  follows  that 

=■  max  r,)  +  p,  +  p,. 

Ci>,i(5',)  =  maxl(Ci^_2i  +  Pi),  rj]  +  pj. 
where  j  is  the  job  in  position  >  —  1 .  Hence,  it  is  easy  to  verify  that 

from  which  the  (1)  immediately  follows. 

Let’s  examine  now  contribution  of  jobs  h  and  /.  If  we  call  t  =  C^y_2](i„),  then  their  weighted 
completion  times  can  be  expressed  by  the  following: 

Under  s„  : 

WjCj  =  =  w,{max(r,  rj)  +  Pj] 

WiCi  =  =  w^fmax  (r,  r,)  +  pj  +  p,); 

Under  s^J: 

WjCj  =  =  wy  {[max  (/  +  p,),  tj]  +  pj] 

w,C,  =  =  Wiit  +  Pi), 

since,  by  assumptions  b),  /■,  <  ti  ^  i,,  <  t. 

Two  cases  exist: 

(1)  rj  <  t 

Eliminating  common  terms,  from  the  (1)  and  assumption  (a),  it  follows  that 

-  C'“(s!,)  =  AC’O')  +  iv^py  -  WjPi  ^  0 

(2)  fj  >  t. 

In  this  case 

C^lSn)  -  C'“{s‘„)  =  AC"'0')  +  Wj(rj  +  p,)  +  w,(ri  +  p/  +  p,) 

-  Wyllmax  (l  +  Pi),  tj]  +  py)  -  w,(t  +  p,). 

Two  subcases  are  to  be  considered: 

(2a)  rj  ^  t  +  Pi- 

From  this  and  inequality  (1)  it  follows  that 

C’"(s„)  -  C'^{s'„)  =  ^C'^iy)  +  Wi  (rj  +  py  -  /)  >  0 

(2b)  fj  <  t  +  Pi. 

In  this  subcase,  it  is  easy  to  verify  that 

C*{s„)  -  C*(s,J)  =■  AC'Cv)  +  (wy  +  w,)  (fy  -  t)  +  {w,pj  -  WjPi)  >  0 

since  the  first  and  third  terms  of  the  sum  are  nonnegative,  by  (1)  and  assumption  (a),  respectively,  and 
the  second  term  is  positive,  being  /y  >  /. 
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Hence,  in  conclusion,  represents  an  improvement  over  s„.  One  could  repeat  the  interchange 
operation  on  by  shifting  job  i  backward  into  position  y  -  2  and  advancing  the  corresponding  job  into 
position  y  -  1.  In  this  way  the  new  sequence  obtained  is  better  than  s!,. 

By  induction,  it  is  possible  to  repeat  this  operation  on  s'  until  job  ;  reaches  position  k  +  2.  Then 
it  follows  that  a_ partial  sequence  s^hi  dominates  every  other  partial  sequence  s^/iy,  where  j  ^  h  and  i  is 
an  element  of  K. 

To  complete  the  proof  let’s  consider  now  as  schedule  s„  the  one  with  jobs  h  and  /  in  the  (k  +  Dth 
and  {k  +  2)th  positions,  respectively. 

Therefore  s,!  differs  from  s„  in  that  job  ;  is  in  position  k  +  1  and  job  h  in  position  k  +  2. 

In  this  case,  since  t  =  C(*|(s„)  can  be  <  r,,  only  the  completion  times  of  s'„  must  be  rewritten  as 
follows: 

Under  s,: 

=  w^{max(max(/.  r,)  +  /?,,  rj  -I-  p^) 

=  *v,(max(/,  /•,)  +  p,|. 

Three  cases  exist.  Cases  (1)  and  (2)  have  already  been  examined.  Hence,  one  needs  only  to 
analyze  the  following: 

(3)  /  <  r,. 

From  assumption  (b)  and  definition  of  f,,  it  is  also  true  that  r,  <  r/,.  Hence, 

C^isJ  -  C'“{s'„)  =  +  2)  +  +  pi,)  +  +/>»,+  a) 

-  ((max  (tj  +  Pi),  r/,]  +  /?*}  -  iv,  (rj  +  a). 


Two  subcases  are  then  to  be  examined: 

(3a)  ra  >  ^ 

In  this  hypothesis  it  follows  immediately  that 

C'ljJ  -  C*(5;)  =  AC"’(fe  +  2)  +  w,(r*  +  p„-  r,)  >  0 

(3b)  <  r,  +  Pi- 

It  is  easy  to  verify  that 

C^Is,)  -  C"(s^)  =  AC'Uc  +  2)  +  (w;,  +  w,)  (r^  -  r,)  +  (w,p^  -  w^^,)  )>=  0 
since  all  terms  of  the  sum  are  nonnegative  by  (1),  assumptions  (b)  and  (a),  respectively. 

In  conclusion  sh  is  better  than  s„. 

Hence,  since  Skhi  dominates  every  sequence  with  s*/i  fixed  and  always  dominates  Si^hi,  it  fol¬ 
lows  that  i  dominates  h  in  position  A:  -l-  1.  Q.E.D. 

Proof  of  Corollary  1 . 
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It  follows  immediately  from  the  previous  theorem. 

As  far  as  theorems  2,  3,  4,  5  and  Corollary  2  are  concerned,  they  have  been  proved  in  [11]. 

Proof  of  Theorem  6: 

It  can  be  immediately  seen  that,  for  k  =  n,  =  C'"(s„).  Then  it  is  only  necessary  to  prove 

that  for  every  k,  C*(s„U*)  <  C^CsnU^+i). 

C’^(s„|5/()  and  C"'(s„U(,+i)  can  be  expressed  in  the  following  way. 

C^fsnUt+i)  =  C'{s„U*+|)  +  C"(s„|s*+i) 

C*(s„|s*)  =  C'(s„|s*)  +  C"(s„|*) 

where 

n-k-  1 

C'(s„U*+,)  =  C‘"(s*)  +  w*+,c*+,(s*)  +  Y.  '^k  +  \+jCk+]+j(sk+0< 

j-i 

£  (-Snl-S/k+l)”  ^  ^  {8-ltQ+>(S/t  +  |)  —  + 

J~2  l-j+l 

-8.,  [C*+/s*+,)  -  C*+/(s*+|)l)min  (w>*+^, 

n-*-l 

C  (s„|s<.)  =  C  (s/[)  +  vv^.^j C*+i (sy|.)  + 

J-1 

+  Y  (®-i tCk+1  ~  ~  S-i  ~ 

1-2 

•  min(H'*+,,  w*+,); 

C"(i„|5*)=  Y  X  |8-|(Q+y(5;t)  — 

J-2  l-j  +  \ 

-  5-i[C*+y(S;t)  -  C*+/(s*)ll  min(iv*+y,  w^+/). 

Therefore,  we  will  have: 

n—k—  1 

+  —  £'(^nl^t:^  ~  ^  '*'*+1+/ {Q  +  l+y (■**  +  1 )  “  Q+l+y^'S/c^J 

y-l 

—  Y  (S-j (Q+i (s*)  —  r*+/(s*)]  —  6_i[C*+|(s*)  —  C*+/(s*)]). 

1-2 

•  min(w*+,,  w*+,). 

Recalling  that  /  =  y  +  1,  the  generic  term  of  the  difference  can  be  expressed  as  follows 
'*'*+/ lQ+/(s*+|)  ~  Q+z^'S*)!  “  (8-ilQ+i(sn)  —  f*+/(sj)l  + 

-8_|[C;t+i(5*)  -  C*+,(s*)ll  min(H-*+,.  »*+,). 

Let  us  consider  the  possible  cases: 

(a)  Cj+i  (s*)  <  tfi+iisi,) 

(b)  r*+/(s*)  ^  C*+i(s*)  ^  C*+/(s*) 

(c)  C*+|(s*)  >  C*+/{s*) 
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In  case  (a)  it  can  be  immediately  verified  that  the  generic  term  is  equal  to  0,  while  in  case  (b)  and 
(c)  it  can  be  expressed,  respectively,  as  follows; 

(b)  iv*+,[Q+/(s*+,)  -  -  lQ+|(Sjk)  -  r*+,(s*)lmin(w*+,, 

>  0,  since  [C*+;(s*+i)  -  C*+/(s*)l  =  [C*+i(s*)  -  i*+,(s*)]; 

(c)  -  C*+/(s*)l  -  •  min  (»*+!,  ^  0, 

since  [Q+/(s*+i)  -  C*+/(s*)]  >  p^+i- 


Then 

C'(sjs*^,)  -  >  0. 

Let  us  consider  now  the  difference  C "(5„ls*+|)  -  C"(x„ls;t).  Calling 
the  generic  term  -  Ci^,  we  have; 

^k+i  “  (8- 1  fQ+y ~  8_) [Q+j (s*+i) 

-Ck+,(s/t+,)]l  min(iv*+j,  w*+,); 

c*  =  {8_i  [C*+j  (x/t)  —  f*+/(x*)]  —  S-iIC/i+j  (x*)  —  C*+;(x*)]. 
•  min 


The  following  cases  can  arise; 

(a)  f*+/(xjt)  ^  C/(+|(x/() 

(b)  C*.,(s*)  <  4+y  (5*) 

(c)  /*+^(x*)  ^  C^  +  |(x*)  <  /k  +  /(Xn). 


Case  a).  According  to  the  hypothesis 

'k+j^h  +  O  "  >k  +  l^h  +  0  =  Q  + 

and  then 


c*  +  i  =  {Pk+j  ~  8-i(Pk+y  -  /'*+/)!  min(w'*+^.  iv*+,) 


Two  subcases  are  possible; 

(a,)  If  <  Pk^i.  then 

c*  +  i  lmin(w*+^. 

0 

(C*+/xJ-  f^+ilx^llminfiv^+y. 


Ck  = 


Therefore,  c*+|  -  >  0. 

(flj)  If  Pk+j  >  Pk+i-  then 


c*+l  “  p*+/l 

min 

I**'*+y. 

0 

Q  = 

Pk  +  I  { 

min 

I*V*+y. 

»*’*+/ 

[Ck+j 

(x*) 

~  h+i 

(s*)Ji 

-  c* 

>  0. 
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Case  (b).  According  to  the  hypothesis  we  have 
and  then  c*+i  =  c^. 

Case  (c) .  According  to  the  hypothesis  we  have 

^*+>(•**+1)  ^  Ci,+\(sii)  and  /*+/(5*)  =  4+/(s;t+j). 

Three  subcases  exist. 

(ci)  C*+;(^k+i)  4+/(s*+i)  from  which  follows  that 

c*+i  =  Q  ”  0- 

(^2^  >k  +  l^^k+0  ^  ^k+l^h+\^  ■ 

In  this  case 

c*+i  =  [C*+y(5/t+i)  -  r^+zlst+i)]  min(H'*+;, 

=  1® 

I  (C*+y(s*)  -  4+,(s*)]  min  (w^+j.  w*+,)- 
Therefore,  c*+|  -  c*  ^  0. 

(^3^  Q+;(4+i)  <  Q+>fs*+i)- 

In  this  case 

c*+i  =  p*+/{min  (w^+y,  w^+y)) 

0 

C*  =  P*  +  ,  (min(»V*+y,  + 

lc*+y(s*)-  4+/(s*)Jmin(»*'*+y,  m'*+,) 

Therefore,  c*+|  -  >  0. 

Then,  also,  C"(s„U*+i)  -  C"(s„|s*)  >  0  and  hence,  in  general,  C"'(s„U*+i)  >  C^'Isnls/t),  prov¬ 
ing  that  C”  (s„|s*)  is  a  lower  bound  for  (5„|s*). 

Proof  of  Theorem  7. 

First,  we  show  that  C|*|(5®)  <  C[*j(s“).  I  ^  k  ^  n.  To  this  end  let’s  assume  that  the  ready 
times  are  modified,  defining  rj  =  max(ry,  r,),  V  j  €  N. 

This  assumption  will  not  affect  the  start  or  completion  time  of  any  job  in  s“  or  s“. 

Let’s  construct  now  a  new  sequence  s!,,  reordering,  for  a  given  k,  the  first  /c  jobs  in  s“  according 
to  their  earliest  start  times  (EST)  defined  on  the  basis  of  ready  times  rj.  If  more  jobs  have  the  same 
EST,  priority  is  given  to  the  job  with  the  smallest  completion  time  among  them. 

Let’s  call  the  partial  sequence  so  obtained  s*  and  the  complete  sequence  s'„\  clearly 
C[*i(s;)  <  ci*,(s^). 
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Since,  by  properties  (a)  and  (b),  is  an  ECT  and  EST  sequence  of  the  modified  set  N,  ii  follows 
that,  at  the  first  position  /  (I  ^  <  /c)  in  which  C(,|(s“)  ^  C|;|(s^),  it  must  be  true  < 

r|,|(s;)  ?  si 

Hence,  replacing  y'j'/i  with  yf/j  will  not  result  in  an  increase  of  C\i](.s'„),  I  ^  I'  ^  k. 

If  this  replacement  causes  jobs  in  following  the  new  jy]  to  lose  the  EST  sequencing  property,  it 
is  possible  to  reorder  the  set  by  EST  without  increasing  C[*i(s^),  since  C[i](si  has  been  reduced.  In 
this  manner  is  redefined  to  represent  the  new  sequence.  Repeating  this  procedure  for  all 
I’d  <  /'  <  comparing  C(,|(s®)  and  Cp](s^)  and  making  necessary  changes,  as  shown  above,  will 
produce  a  sequence  identical  to  the  partial  sequence  of  the  first  k  jobs  in  s°,  without  increasing 
C|*)(s®).  Thus,  C|;t)(5“)  <  C(*)(s^)  ^  C(n(s“),  1  ^  k  <  rt,  so  proving  the  first  statement.  To  com¬ 
plete  the  proof,  remember  that,  by  definition, 

/c-l 

and 

C'Cs?)  =  £  Wi*jCui(5?). 

*-i 


It  is  well  known  that  such  a  sum  of  pairwise  products  of  two  sequences  of  numbers  will  be  minim¬ 
ized  if  one  sequence  is  arranged  in  increasing  order  and  the  other  in  decreasing  order.  Since  the  com¬ 
pletion  times  C(*i  are  already  in  increasing  order,  the  minimization  of  C"  is  accomplished  if  sequencing 
is  such  that  the  weights  W(;^|  are  in  decreasing  (or  at  least  nonincreasing)  order.  Therefore,  being  si 
and  s"  sequences  defined  on  the  same  set  N  ,  from  assumption  (c)  and  previous  results  obtained,  it  fol¬ 
lows  that 

C-'Cs®)  <  C'“(s^) 
which  proves  the  theorem. 

Proof  of  Corollary  3. 

If  follows  immediately  from,  the  previous  theorem. 
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ABSTRACT 

This  exposilion  presents  a  method  for  incorporating  a  technique  known  as 
"splitting  the  bump"  within  an  elimination  form  reinversion  algorithm.  This 
procedure  is  designed  to  reduce  fill-in  during  reinversion  and  should  improve 
the  efficiency  of  linear  programming  systems  which  already  use  the  superior  el¬ 
imination  form  of  the  inverse. 


1.  INTRODUCTION 

Current  production  linear  programming  systems  are  designed  to  handle  problems  with  8000  to 
16,000  rows  and  1000  row  problems  are  considered  to  be  medium  sized  (see  [10,  17)).  Fortunately, 
real  problems  tend  to  be  sparse  (the  density  of  the  constraint  matrix  is  often  less  than  1%  — see  [1.  2,  9, 
14,  151).  Hence,  a  basis  for  a  1000  row  problem  may  have  only  (1000)  (1000)  (1%)  =  10,000  nonzero 
entries.  However,  the  inverse  (which  is  required  for  the  revised  simplex  algorithm)  may  be  quite 
dense  having  almost  a  million  nonzero  elements.  Consequently,  one  of  the  most  important  design  con¬ 
siderations  for  a  computer  implementation  of  the  revised  simplex  algorithm  for  general  linear  programs 
is  the  technique  used  to  maintain  and  update  the  basis  inverse. 

In  order  to  minimize  the  storage  required  to  implement  this  algorithm,  production  linear  program¬ 
ming  systems  maintain  the  inverse  of  the  basis  in  either  product  or  elimination  form  (see  [1,  3,  8,  11, 
12,  I7J).  Computationally,  the  inverse  is  stored  as  a  sequence  of  vectors  known  as  eta  vectors,  and  the 
complete  list  of  the  eta’s  is  called  the  ETA  File.  Each  basis  change  (pivot)  results  in  appending  at  least 
one  eta  vector  to  the  ETA  File.  Since  both  the  time  per  pivot  and  numerical  error  increase  as  the 
length  of  the  ETA  File  increases,  it  is  necessary  to  periodically  obtain  a  new  factorization  of  the  basis 
inverse.  This  process  of  obtaining  a  new  factorization  is  called  reinversion.  The  objective  of  a  reinver¬ 
sion  algorithm  is  to  obtain  a  factored  inverse  (i.e.,  ETA  File)  in  which  the  sparsity  characteristics  of  the 
original  basis  are  preserved  as  much  as  possible. 

The  simplest  reinversion  technique  for  a  given  m-column  basis  can  be  thought  of  as  successively 
reducing  the  basis  to  an  identity  matrix  via  m  pivot  operations.  The  matrices  which  accomplish  this 
reduction  constitute  the  ETA  File.  Out  of  this  simple  approach,  reinversion  techniques  have  evolved 
which  attempt  to  obtain  a  sparse  factorization  by  selection  of  pivot  positions,  involving  a  reordering  of 
the  columns  of  the  basis  and  a  conceptual  reordering  of  the  rows.  In  addition,  a  technique  known  as 
splitting  the  bump,  18,  131,  has  been  found  beneficial  when  used  within  a  product  form  reinversion 
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algorithm.  Recently,  however,  computational  evidence  indicates  that  the  elimination  form  factorization 
is  superior  to  the  product  form  (with  respect  to  both  storage  requirements  and  speed  (see  (21).  The 
objective  of  this  exposition  is  to  present  a  reinversion  algorithm  in  which  one  may  achieve  some  of  the 
benefits  of  the  "splitting  the  bump  technique"  when  using  the  elimination  form  of  the  inverse. 

For  this  note,  the  rth  column  of  the  m  x  m  matrix  B  is  denoted  by  B(i).  The  symbol  e'  denotes 
the  m-component  column  vector  having  ith  component  one,  and  all  other  components  zero.  The  sym¬ 
bol  17  denotes  an  m-component  column  vector  and  i),  denotes  the  rth  component  of  this  vector. 

2.  FACTORIZATION  ALGORITHMS 


Let  B  be  any  m  by  m  nonsingular  matrix.  In  this  section  we  present  two  algorithms  for  obtaining 
a  factorization  of  The  first  algorithm  produces  a  product  form  factorization  which  corresponds  to 
the  method  for  solving  a  system  of  linear  equations  known  as  Gauss-Jordan  reduction  while  the  second 
algorithm  produces  an  elimination  form  factorization  which  corresponds  to  a  Gauss  reduction  (see  [4]). 


By  row  and  column  interchanges,  B  may  be  placed  in  the  following  form,  where  B'  and  B^  are 
lower  triangular  matrices  with  nonzeroes  on  their  diagonals: 


(1) 


B*  \  B^ 

B*‘ 

B^ 

B^ 

We  assume  that  if  B^  is  nonvacuous,  every  row  and  column  has  at  least  two  nonzero  entries,  so  that  no 
rearrangement  of  B^  can  expand  the  size  of  B'  or  B^.  B^  is  called  the  bump,  merit,  or  heart  section. 
We  further  require  the  heart  section  to  assume  the  following  form  where  (7*’s  are  either  vacuous  or 
lower  triangular  with  nonzeroes  on  the  diagonal 


The  only  partitions  in  B  having  columns  with  nonzeroes  above  the  diagonal  are  the  F^’s  which  are 
called  external  bumps.  The  columns  extending  above  the  diagonal  are  called  spikes  or  spike  columns. 
An  external  bump  is  characterized  as  follows; 
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(i)  the  last  column  of  an  external  bump  will  be  a  spike  with  a  nonzero  lying  in  the  topmost 
row  of  the  external  bump,  and 

(ii)  the  nonspike  columns  have  nonzero  diagonal  elements. 

The  algorithms  of  Hellerman  and  Rarick  (6,  7]  produce  such  a  structure  for  any  nonsingular  matrix  B, 
and  we  shall  call  a  matrix  having  this  structure  an  HR  matrix. 

The  product  form  factorization  algorithm  for  an  HR  matrix  is  given  as  follows: 

ALGORITHM  1:  PRODUCT  FORM  FACTORIZATION  FOR  A  HR  MATRIX 
0.  Initialization 


Set  /  —  1  and  B  5(1). 

1.  Obtain  New  Eta 


Set  T);,  ^ 

and  set  E‘(k)  *— 


1/5,,  for  k  =  i 

otherwise, 

T),  for  k  =  / 
c*,  otherwise. 


(Note:  only  t)  and  i  need  be  saved  in  the  ETA  File  rather  than  the  matrix  E') 


2.  Test  For  Termination 


If  /  =  m,  terminate,  otherwise,  /  —  /  +  1. 

3.  Test  For  Spike 

If  Bii)  is  not  a  spike,  set  5  Bii)  and  go  to  1. 

4.  Spike  Update 

Let  B(k)  correspond  to  the  first  column  of  the  external  bump  containing  Bii).  Set 
5  —  ...  E'^Bii). 

5.  Swap  Spikes  If  Pivot  Element  Equals  Zero 

If  5,  0,  go  to  1;  otherwise,  there  is  some  spike  BiJ)  in  the  same  external  bump  having 

j  >  i  such  that  the  fth  component  of  . . .  E'^Bij)  is  nonzero.  Set  5  —  . . .  £*50'), 

interchange  Bii)  and  Bij)  and  go  to  1. 

In  practice,  the  test  for  a  zero  pivot  element  in  Step  5  is  usually  replaced  by  a  tolerance  test.  Let 
TOL  denote  the  tolerance  to  be  used  in  the  test.  If  l5,  l  ^  TOL,  then  5,  is  treated  as  if  it  is  zero.  Dis¬ 
cussions  of  tolerances  may  be  found  in  [12,  15,  16).  Similar  tolerance  tests  would  ordinarily  be  incor¬ 
porated  in  the  other  algorithms  to  be  presented  in  this  exposition.  To  simplify  the  presentation  they 
have  been  omitted  here. 

Justification  for  Algorithm  1  is  given  in  [5].  At  termination,  5”'  =  ...£'.  Further¬ 

more,  we  see  that  for  nonspike  columns,  each  5  and  consequently  each  rj  has  exactly  the  same  number 
of  nonzeroes  as  the  corresponding  column  of  5.  However,  the  tj  for  a  spike  column  may  have  a  higher 
density  than  the  original  column  of  5.  The  phenomena  of  an  tj  having  a  higher  density  than  the 
corresponding  column  of  5  is  known  as  fill-in. 


VOL.  29.  NO  I,  MARCH  1982 


NAVAL  RESEARCH  LOGISTICS  QUARTERLY 


1 72  R,  V.  HELGASON  AND  J.  L.  KENNINGTON 

It  is  well  known  that  fill-in  may  be  reduced  by  applying  an  elimination  form  factorization  rather 
than  the  product  form  (see  Ill).  The  elimination  form  algorithm  for  an  HR  matrix  is  given  as  follows: 

ALGORITHM  2:  ELIMINATION  FORM  FACTORIZATION  FOR  A  HR  MATRIX 


0.  Initialization 

Set  /■  *—  1.  r  *—  0,  and  13  B(l). 

1 .  Obtain  New  Lower  Eta 

l/Bi,  for  k  =  / 

Set  T)*  ^  {  -/3*//3„  for  k  >  i 
0,  otherwise. 


and  set  L‘{k) 


T),  for  k  ~  i 
e*,  otherwise. 


(Note:  only  tj  and  i  need  be  saved  in  the  ETA  File  for  the  lower  factors) 
2.  Test  for  Termination 


If  /  =  m,  terminate;  otherwise,  i  —  i  +  1. 

3.  Test  for  Spike 

If  B(.i)  is  not  a  spike,  set  /3  <—  B{i)  and  go  to  1. 

4.  Spike  Update 

Let  B(k)  correspond  to  the  first  column  of  the  external  bump  containing  Bii).  Set 
B  —  ...  L'^Bii). 


5.  Swap  Spikes  if  Pivot  Element  Equals  Zero 

If  Bi  9^  0,  go  to  6;  otherwise,  there  is  some  spike  B(j)  in  the  same  external  bump  having 
j  >  i  such  that  the  /th  component  of  L'~’  . . .  L'‘B(J)  is  nonzero.  Set  jS  —  . . .  L'‘B(j) 

and  interchange  Bii)  and  Bij). 

6.  Obtain  New  Upper  Eta 


Set  r  —  r  +  \ 


Set 


set 


Vk 


1,  for  k  =  i 
-Bk-  for  k  <  i 
0,  otherwise. 


77,  for  A:  -  / 

U'ik)  ^  k 

e”,  otherwise. 


and  go  to  1 . 
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(Note:  only  tj  and  i  need  be  saved  in  the  ETA  File  for  the  upper  factors).  At  the  termination  of 
Algorithm  2,  =  f/,  . . .  l/'L'”  where  the  upper  eta’s  are  upper  triangular  and  the  lower  eta’s 

are  lower  triangular.  As  with  Algorithm  1  fill-in  has  been  restricted  to  the  eta’s  corresponding  to  spike 
columns. 


3.  SPLITTING  THE  BUMP 


In  an  attempt  to  reduce  the  fill-in  which  occurs  in  spike  columns  during  a  reinversion  using  Algo¬ 
rithm  1,  a  variation  of  the  elimination  form  factorization  algorithm  (attributed  to  Martin  Beale,  see 
[13])  has  been  used  by  some  practitioners.  This  technique  has  been  called  "splitting  the  bump"  after  its 
treatment  of  external  bump  columns. 


Consider  a  set  of  columns  of  the  basis  corresponding  to  an  external  bump,  say 

ol 


rows 


I" 

I  p  rows 


where  F  is  an  external  bump.  Suppose  F  contains  q  spikes.  If  we  apply  the  standard  product  form  algo¬ 
rithm,  we  obtain  a  set  of  eta’s  such  that 


£"...  E'B^ 


]n  . 


The  eta’s  corresponding  to  the  q  spikes  may  incur  fill-in.  If  we  split  the  bump,  the  fill-in  can  be  res¬ 
tricted  to  the  n  rows  corresponding  to  £;  hence,  we  may  avoid  fill-in  in  the  last  p  rows.  Since  p  may  be 
much  larger  than  n,  the  savings  could  be  substantial.  The  price  which  must  be  paid  to  attain  this  reduc¬ 
tion  in  fill-in  is  that  each  external  bump  will  require  2n  +  q  eta’s  rather  than  n.  The  savings  in  fill-in  is 
typically  so  great  that  it  offsets  the  additional  storage  which  must  be  given  up  for  the  additional  eta’s. 


The  product  form  algorithm  incorporating  the  "splitting  the  bump"  technique  is  given  as  follows: 


ALGORITHM  3.  PRODUCT  FORM  FACTORIZATION  FOR  A  HR  MATRIX 
INCLUDING  SPLITTING  THE  BUMP 


0.  Initialization 

Set  /  —  1,  y  —  1,  and/3  —  Bil).  If  B(\)  is  in  an  external  bump,  go  to  4. 

1.  Obtain  New  Lower  Eta 

l//3i,  for  k  =  I 


1  -pjBi,  otherwise, 

and  set  EHk) 


T),  for  k  =  / 
e*.  otherwise. 


Set  J  —  J  +  \. 
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2.  Test  for  Termination 

If  /  “  m,  terminate;  otherwise,  /  —  /  +  1  and  /3  —  B(i). 

3.  Test  for  External  Bump 

If  B(/)  is  not  the  first  column  in  an  external  bump,  go  to  1. 

4.  Initialization  for  Bump 

/  J  (current  length  of  Eta  File) 
s  —  /  (first  column  in  this  external  bump) 

/>  number  of  columns  in  this  external  bump 
/  —  /■  +  -  1  (last  column  in  this  external  bump) 

5.  Obtain  Lower  Eta 

l/0i,  for  k  =  / 

Set  rii^  —  for  i  <  k  t 

0,  otherwise, 

7).  fot  Ir  “  i 

and  set  EHk)  —  *  .. 

e  ,  otherwise. 

Set  J  —  j  +  1 . 

6.  Test  for  End  of  Bump 

If  /  =  t,  go  to  10,  otherwise,  /  —  /  +  1. 

7.  Test  for  Spike 

If  B(i)  is  not  a  spike,  set  /3  *—  B(i)  and  go  to  5. 

8.  Spike  Update 

Sel^  —  ...  E'BU). 

9.  Swap  Spikes  if  Pivot  Element  Equals  Zero 

If  B,  ^  0,  go  to  5;  otherwise,  there  is  some  spike  B(r)  in  the  same  external  bump  having 
r  >  i  such  that  the  ith  component  of  . . .  E'Bir)  is  nonzero.  Set  0  —  . . .  E'B(r), 

interchange  Bfr)  and  B{i)  and  go  to  5. 


10.  Obtain  Upper  Eta 

1,  for  k  =  i 

Set  Tfk  —  -Bk'  foi"  *  <  ' 
0,  otherwise, 


and  set  EHk) 


Tj,  for  k  =■  i 
e*,  otherwise. 


Set  j  —  +  I  and  /  *—  /  —  1 . 
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11.  Test  for  Beginning  of  Bump 

If  /  =  s,  set  j8  •-  fl  (/)  and  go  to  13. 

12.  Test  for  Spike 

If  B(i)  is  not  a  spike,  set  ;  —  1  and  go  to  11;  otherwise,  set  p  *—  . . .  E'Bii)  and 

go  to  10. 

13.  Obtain  Lower  Eta 


Set  rj* 


1.  for  k  =  /■ 
-/3*,  for  it  >  f 
0,  otherwise. 


and  set  EHk) 


1),  for  ^  =  /■ 
e*,  otherwise. 


Set  j  j  +  \. 


14.  Test  for  End  of  Bump 

If  i  =  r,  go  to  2;  otherwise  i  *-  i  +  1,  set  p  *—  5(/),  and  go  to  13. 

At  the  termination  of  Algorithm  3,  B~'  =  E^~^  ...EK  Furthermore,  all  eta’s  are  either  upper  triangu¬ 
lar  or  lower  triangular,  but  they  are  intermixed. 

Recall  that  Algorithm  2  produces  a  set  of  lower  triangular  factors  followed  by  a  set  of  upper  tri¬ 
angular  factors.  If  one  incorporates  the  "splitting  the  bump"  technique  into  Algorithm  2  and  applies  it 
to  an  HR  matrix,  the  lower  and  upper  factors  become  intermingled.  Once  the  factors  have  become 
intermingled,  we  may  no  longer  use  the  important  algorithm  of  Forrest  and  Tomlin  [2]  to  maintain  the 
elimination  form.  We  now  show  how  one  may  achieve  a  partial  "splitting  of  the  bump"  while  simul¬ 
taneously  maintaining  a  partitioning  of  the  upper  and  lower  factors. 

Recall  that  an  HR  matrix  takes  the  form  given  in  (1). 

By  a  rearrangement  of  rows  and  columns,  the  HR  matrix  (1)  may  be  placed  in  the  following  form: 


B^ 

B^ 

B^ 

B' 

B* 

B^ 

u  V  w 
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where  fl'’  and  are  row  permutations  of  B*'  and  fi^_respect^vely,  and  B^  is  ^row  and  column  permu¬ 
tation  of  B^.  Applying  a  variation  of  Algorithm  2  io  B  eliminates  all  fill-in  in  B^  while  representing  5"' 
as  a  product  of  upper  factors  followed  by  a  product  of  lower  factors.  The  details  are  given  in  the  fol¬ 
lowing  algorithm. 


ALGORITHM  4:  ELIMINATION  FORM  FACTORIZATION  FOR  A  MODIFIED  HR  MATRIX 
0.  Initialization 

Set  /  *-  M  -I-  1  and  /3  —  BU).  If  v  =  0,  go  to  3. 


1.  Obtain  New  Lower  Eta 


Set 


1//3,,  for  k  =  ;■ 

fof  k  >  i 
0,  otherwise. 


2. 


3. 


and  set  L'(k) 


ri,  for  k  =  I 
e'‘,  otherwise. 

Test  for  End  of  Section  2 


Set  I  ~  /+  I  and  /3  — •  B(/).  If  /  <  t/  v,  go  to  1.  If  w  =  0,  go  to  8. 
Obtain  New  Lower  Eta 


Set  T)*  — 


1//3/,  for  k  -  i 
-pjpi,  for  k  >  i 
0,  otherwise, 


and  set  L‘(k) 


7},  for  k  =  i 
otherwise. 


4.  Test  for  End  of  Section  3 


If  /  =  m,  go  to  8;  otherwise,  i  i  +  I. 

5.  Test  for  Spike 

If  B(i)  is  not  a  spike,  setjS  —  BU)  and  go  to  3. 

6.  Spike  Update 

Set/3  —  L'-'  ...  L‘‘*''^'BU). 

7.  Swap  Spikes  if  Pivot  Element  Equals  Zero 

If  B,  ^  0,  go  to  3;  otherwise,  there  is  some  spike  B{j)  in  the  same  external  bump  having 
j>  i  such  that  ^he  rth  component  of  _Z'“' . . .  fl(/)  is  nonzero.  Set 

/3  —  L'"'  ...  50')>  interchange  BU)  and  BiJ)  and  go  to  3. 
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8.  Obtain  New  Upper  Eta 

1,  for  k  =  i 

Set  T)*  *-  -Pk’  for  *  <  ' 

0,  otherwise, 

T/,  for  k  =  i 

and  set  )  ^  | 

9.  Check  for  End  of  Section  2 

.  Set  /  >—  /  -  1.  If  I  =  0,  terminate. 

If  I  =  IV,  set  /3  —  B(i)  and  go  to  12. 

10.  Set  Column 

If  /  >  u  +  r,  set  /3  —  L' . . .  BU):  otherwise, 

set  B  B(i).  Go  to  8. 

11.  Obtain  New  Upper  Eta 

\/Bi.  for  k  =  i 

Set  rik  —  -Bklfii-  for  k  <  i 
0,  otherwise, 

t],  for  k  =  i 

and  set  ^  |  otherwise. 

12.  Check  for  Termination 

If  /  =  1,  terminate;  otherwise /3  *—  B{i)  and  go  to  11. 

At  the  termination  of  Algorithm  4,  =  (/’...  U'”L'"  ...  C"^',  and  the  fill-in  has  been  restricted  to 

B^.  Hence,  Algorithm  4  gains  some  of  the  benefits  of  the  "splitting  the  bump"  technique  while  main¬ 
taining  a  partitioning  of  the  upper  and  lower  factors.  The  benefits  are  not  as  great  as  with  ordinary 
"bump  splitting"  since  each  individual  external  bump  is  split,  whereas  here  the  split  is  with  respect  to 
the  entire  heart  section. 

A  variation  of  Algorithm  4  has  also  been  used  by  Tomlin  (15).  Our  contribution  is  that  we  have 
tied  together  the  ideas  of  "splitting  the  bump"  in  both  the  product  and  elimination  form  factorizations 
and  we  have  explicitly  indicated  via  Algorithms  3  and  4  how  these  may  be  implemented. 
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ABSTRACT 

This  note  considers  a  recently  proposed  solution  method  for  a  multifacility 
location  problem.  It  is  shown  that  the  method  does  not  always  produce  an  op¬ 
timal  solution. 


In  a  recent  issue  of  this  journal  Calamai  and  Charalambous  [I]  conclude  that  many  existing 
methods  designed  for  solving  the  multifacility  location  problem  are  either  poorly  structured,  subop- 
timal,  or  haphazard  (p.  619).  It  should  be  pointed  out  that  their  proposed  algorithm  belongs  to  the 
same  class  of  methods,  in  the  sense  that  the  algorithm  sometimes  gives  a  suboptimal  solution. 

The  authors  base  their  algorithm  on  a  set  of  necessary  conditions  and  seem  to  believe  that  this  set 
of  conditions  is  also  sufficient  for  optimality,  although  sufficiency  has  yet  to  be  proven  (p.  617).  In  a 
problem  involving  rectilinear  distances  the  necessary  conditions  are  also  sufficient  [3].  This  result  holds 
for  Euclidean  distances,  too,  as  long  as  each  case  of  facility  coincidence  involves  just  two  facilities  [2]. 
But  if  a  current  solution  to  a  problem  involving  Euclidean  distances  has  three  or  more  facilities  located 
at  the  same  point,  the  author’s  set  of  necessary  conditions  are  not  in  general  sufficient  for  optimality,  as 
shown  by  the  following  example. 

Consider  a  current  solution  where  new  facilities  1,  2,  and  3  occupy  the  same  location  and  no  other 
facility  occupies  this  location.  Let  the  given  weights  relating  to  these  new  facilities  be  Vjj  =  1,  v,3  =  1, 
and  V23  =  5’^^  -  1.  Suppose  the  other  weights  and  facility  locations  are  such  that  the  pseudo-gradients 
in  (4.2c)  (p.  613)  are  G]  =  (2.  0),  Gj  =  (0,  1),  and  Gj  =  (—2,  -1).  Now  consider  the  necessary  con¬ 
ditions  for  optimality  in  (5.3)  (p.  613).  When  5  contains  one  element,  the  left  hand  sides  of  the  condi¬ 
tion  are  2,  1,  and  5'^^  whereas  the  right  hand  sides  are  2,  5*''^  and  5'''^.  Hence,  moving  one  of  the 
new  facilities  1,  2,  or  3,  as  done  in  step  (3)  (p.  615)  of  the  authors’  algorithm,  does  not  yield  a  better 
solution.  Moving  the  entire  cluster  of  these  new  facilities  is  likewise  unsuccessful,  since  the  sum  of  the 
pseudo-gradients  is  (0,  0).  Finally,  when  5  contains  two  elements,  the  conditions  for  S  containing  one 
element  are  repeated  so  moving  a  subset  cluster,  as  done  in  step  (5)  of  the  algorithm,  would  not  yield  a 
better  solution. 

Summarizing,  the  example  represents  a  situation  where  the  algorithm  will  stop,  because  the 
necessary  conditions  for  optimality  are  satisfied.  But  the  current  solution  is  suboptimal,  because  the 
directional  derivative  is  not  nonnegative  for  all  directions.  To  verify  this  consider,  for  instance,  a  direc¬ 
tion  vector  D  where  the  only  nonvanishing  two-dimensional  component  vectors  are  D|  —  (-4,  -1)  and 
D  ”  (-2,  -1).  (D  can  be  normalized  into  a  unit  vector  by  using  the  factor  22’''^  if  desired.)  The 
directional  derivative  at  the  current  solution  in  this  direction  is  given  by  (4.1)  (p.  612),  which  yields; 
-8-  1  -l-O-l-  2-f  17'''^-!-  (5-  5’^^),  which  is  negative.  Thus,  a  better  solution  can  be  achieved  by  keeping 
new  facility  3  fixed,  and  by  moving  new  facilities  1  and  2,  not  as  a  subcluster,  but  along  the  separate 
directions  indicated  above. 
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The  possibility  that  facilities  may  coincide  renders  the  objective  function  in  the  multifacility  loca¬ 
tion  problem  nondifferentiable,  and,  thus,  difficult  to  handle  computationally.  Until  a  satisfactory 
nondifferentiable  optimization  technique  has  been  developed,  decision  makers  should  continue  to  use  a 
differentiable  approximation  to  their  problem  (unless  the  distances  are  rectilinear).  The  new  algorithm 
is  of  dubious  value  to  a  decision  maker  interested  in  an  optimal  solution  to  the  problem. 

Finally,  it  might  be  pointed  out  that  problem  6  (p.  618)  can  be  solved  by  inspection  using  corol¬ 
lary  2  and  corollary  1  in  (4).  The  results  in  this  reference  offer  the  potential  for  reduction  of  practical 
problems  by  exploiting  the  special  weight  structure. 
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